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Preface 


Every  reader  will  find  something  of  interest  in  this  book.  From  super  diffusion 
of  ocean  surface  to  fetal  heart  beats,  from  solar  wind  to  cellular  automata,  from 
radioactive  contamination  to  texture  analysis,  from  image  rendering  to  neural  de¬ 
velopments.  The  all-pervading  link  connecting  these  disparate  disciplines  is  the 
realization  that  a  linear  approach  to  the  majority  of  natural  processes  is  at  best 
only  an  approximation  that  can  frequently  be  downright  misleading.  Consequently, 
the  rise  of  what  is  broadly  called  the  theory  of  complexity  has  gained  tremendous 
momentum  in  the  last  decade  or  two.  This  modern  approach  aims,  and  frequently 
succeeds,  in  correctly  explaining  many  natural  processes. 

Papers  in  this  book  are  based  on  presentations  at  the  6th  International  Confer¬ 
ence  exploring  the  above-mentioned  issues.  The  conferences  are  now  regular  and 
well-established  among  the  nonlinear  series  of  conferences.  This  conference  series 
is  organized  in  different  geographical  regions,  to  encourage  international  collabora¬ 
tion.  Among  the  distinguishing  features  of  this  series  is  its  multidisciplinary  nature, 
which  has  been  growing  steadily. 

There  are  two  papers,  based  on  the  invited  talks  by  the  eminent  authorities  in 
their  respective  fields,  L.  O.  Chua  (USA)  and  H.  Takayasu  (Japan). 

The  Fractal  2000  conference  was  partially  supported  by  the  Department  of  the 
Navy  Grant  N00014-00-1-1003  issued  by  the  Office  of  Naval  Research  International 
Field  Office.  The  conference  was  also  made  possible  through  the  generous  help  of 
the  following  members  of  the  program  committee  (in  alphabetical  order): 

D.  Avnir  (Israel)  R.  C.  Ball  (UK)  A.  Coniglio  (Italy)  M.  Daoud  (France)  C.  J.  G. 
Evertsz  (Germany)  K.  Falconer  (UK)  J.  Feder  (Norway)  J.-F.  Gouyet  (France) 
P.  Grassberger  (Germany)  R.  A.  M.  Gregson  (Australia)  M.  H.  Jensen  (Denmark) 
R.  Kapral  (Canada)  M.  S.  Keane  (Netherlands)  C.  M.  Kim  (Korea)  J.  Klafter 
(Israel)  J.  Kowalski  (USA)  G.  Radons  (Germany)  D.  Saupe  (Germany)  H.  E. 
Stanley  ( USA)  W.  H.  Steeb  (South  Africa)  H.  Takayasu  (Japan)  C.  Tsallis  (Brazil) 
E.  R.  Vrscay  (Canada)  J.  Zhang  (P.  R.  of  China) 

The  details  about  the  next  conference  in  this  series  will  be  posted  on  the  website 
http : //www . kingston . ac . uk/f ractal/ . 


M.  M.  Novak 

Kingston-upon-  Thames,  UK 
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THE  ORIGIN  OF  COMPLEXITY 

LEON  0.  CHUA 

Department  of  Electrical  Engineering  and  Computer  Sciences . 
University  of  California  at  Berkeley . 

Berkeley ,  CA  94720 >  C/SA 
E-mail:  chua@eecs.berkeley.edu 

Nature  abounds  with  complex  patterns  and  structures  emerging  from  homogeneous 
media  operating  far  from  thermodynamic  equilibrium.  Such  phenomena,  which  are 
widely  observed  in  both  inanimate  (non-biological)  and  biological  media,  can  be 
modeled  and  studied  via  the  CNN  (Cellular  Neural/Nonlinear  Network)  paradigm 
in  an  in-depth  and  unified  way.  Whether  a  homogeneous  medium  is  capable  of 
exhibiting  complexity  depends  on  whether  the  CNN  cells ,  or  its  couplings ,  is  lo¬ 
cally  active  in  a  precise  mathematical  sense.  This  local  activity  principle  is  of 
universal  generality  and  is  responsible  for  all  symmetry  breaking  phenomena  ob¬ 
served  in  a  great  variety  of  non-equilibrium  media  ranging  from  the  emergence  of 
negative  differential  conductance  in  bulk  semiconductor  materials  (e.g.,  Gallium 
Arsenide  in  Gunn  Diodes)  to  the  emergence  of  artificial  life  itself.  The  main  re¬ 
sult  of  this  paper  consists  of  a  set  of  explicit  analytical  conditions  for  calculating 
the  parameter  ranges  necessary  for  the  emergence  of  a  non-homogeneous  static  or 
dynamic  pattern  in  a  homogeneous  medium  operating  under  an  influx  of  energy 
and/or  matter.  The  resulting  “complexity  related”  inequalities  are  applicable  to 
all  media,  continuous  or  discrete,  which  have  been  mapped  into  a  CNN  paradigm. 


One  of  the  most  interesting  aspects  of  the  world  is  that  it  can  be 
considered  to  be  made  up  of  patterns.  A  pattern  is  essentially 
an  arrangement.  It  is  characterized  by  the  order  of  the  elements 
of  which  it  is  made  rather  than  by  the  intrinsic  nature  of  these 


elements. 


Norbert  Wiener 


1  Introduction 

How  does  the  leopard  get  its  spots?  How  does  the  zebra  get  its  stripes?  How  does 
the  fingerprint  get  its  patterns?  How  does  an  ant  colony  manage  to  self-organize 
into  an  impressive  pattern  of  activities  when  individual  ants  are  known  to  be  quite 
dumb?  How  does  an  initially  mixed  distribution  of  black  and  white  population  in 
a  housing  community  manage  to  redistribute  over  time  into  segregated  black  and 
white  neighborhoods  with  well-defined  boundaries? 

How  does  a  bee  sting  at  a  finger  tip  trigger  the  propagation  of  an  electrical 
impulse  to  the  brain  of  a  healthy  person,  and  how  does  this  distress  signal  fail 
to  propagate  in  the  nerves  of  patients  suffering  from  multiple  sclerosis?  How  do 
some  members  of  a  colony  of  starving  amoeba  send  out  a  target  and  spiral  wave 
signal  to  attract  neighboring  amoebae  and  transform  them  into  spores,  and  then  to 
regenerate  into  amoeba  again  when  food  (bacteria)  becomes  available?  How  does  a 
“scroll  wave”  get  generated  in  the  cardiac  muscle  by  the  inadvertent  presence  of  an 
electrical  impulse  during  a  vulnerable  window  of  a  few  milliseconds,  often  leading 
to  sudden  cardiac  death? 
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The  above  phenomena  are  some  manifestations  of  a  multidisciplinary  paradigm 
called  emergence ,  or  complexity.  They  share  a  common  unifying  principle  character¬ 
istic  of  dynamic  arrays ,  such  as  cellular  neural  networks ,  namely,  interconnections  of 
a  sufficiently  large  numbers  of  simple  dynamical  units,  which  can  exhibit  extremely 
complex,  synergetic,  and  self- organizing  behaviors.  The  common  denominator  in 
all  of  these  pattern  formation  and  active  wave  propagation  phenomena  is  the  pres¬ 
ence  of  an  active  medium ,  powered  by  a  constant  supply  of  external  energy.  For 
example,  in  the  brain,  the  active  medium  is  provided  by  a  sheet-like  array  of  mas¬ 
sively  interconnected  excitable  neurons  whose  energy  comes  from  the  burning  of 
glucose  with  oxygen.  In  cellular  neural  networks,  the  active  medium  is  provided 
by  the  local  interconnections  of  active  cells,  whose  building  blocks  include  active 
nonlinear  devices  (e.g.,  CMOS  transistors)  powered  by  batteries. 

Research  on  Emergence  and  Complexity  has  gained  immense  momentum  during 
the  past  decade1  “ 15 .  The  fundamental  problem  is  to  uncover  nature’s  secret  mech¬ 
anisms  which  are  responsible  for  the  self  organization  and  spontaneous  emergence 
of  many  stable  complex  ( static  and  dynamic)  patterns®  in  homogeneous  media  op¬ 
erating  far-from-thermodynamic  equilibrium  16 .  Indeed,  nature  is  abound  with  all 
sorts  of  patterns  ranging  from  regular  snow  flakes  to  chaotic  brain  waves1^’18,10. 
Understanding  and  controlling  such  patterns  is  essential  for  designing  new  genera¬ 
tions  of  brain-like  molecular  devices  and  systems  endowed  with  artificial  intelligence 
and  self- repair  capabilities. 

The  homogeneous  media  alluded  to  above  consists  usually  of  an  active  bulk 
medium  (e.g.,  bulk  materials  with  negative  resistivity,  such  as  Gallium  Arsenide  in 
Gunn  Diodes20,  nerve  membranes,  heart  tissue  layers,  chemical  mixtures  in  stirred 
reactor  tanks,  etc.)  modeled  by  one  or  more  nonlinear  PDE ’s  (partial  differential 
equations)  where  the  spatial  coordinate,  as  well  as  the  state  variables,  are  repre¬ 
sented  by  continuous  real  numbers.  What  is  truly  fascinating  is  that  while  these 
active  media  are  completely  unrelated— they  can  range  from  inanimate  materials 
to  living  tissues21— yet  the  patterns  they  exhibit  tend  to  resemble  each  other  under 
appropriate  initial  and  boundary  conditions.  It  makes  sense  therefore  to  hypothe¬ 
size  that  a  common  mechanism  must  be  responsible  for  the  emergence  of  each  type 
of  patterns  (e.g.,  Turing  patterns,  spiral  waves,  etc.)6.  This  remarkable  observa¬ 
tion  motivates  the  development  of  a  unified  paradigm  capable  of  exhibiting  most,  if 
not  all,  static  and  dynamic  patterns  (i.e.,  dissipative  structures )  in  active  homoge¬ 
neous  media  operating  far  from  thermodynamic  equilibrium.  Such  a  paradigm  has 
been  developed  recently  and  is  the  subject  of  a  recent  treatise  22.  The  paradigm  is 
dubbed  the  CNN,  an  acronym  for  cellular  neural  networks  when  used  in  the  context 
of  brain  science,  or  cellular  nonlinear  networks ,  when  used  in  other  more  general 
contexts. 

A  CNN  is  defined  by  two  mathematical  constructs: 

1.  A  spatially  discrete  collection  of  nonlinear  dynamical  systems  called  cells , 
where  information  can  be  encrypted  into  each  cell  via  3  independent  variables 


aSuch  patterns  are  called  dissipative  structures  by  Ilya  Prigogine14  because  energy  must  be  con¬ 
tinually  supplied  and  dissipated  in  order  to  maintain  such  structures. 

6For  related  works  on  emergence  and  complexity  from  different  perspectives,  the  reader  is  referred 
to  the  very  readable  expositions  by  Crick8,  Eigen10,  Gell-Mann7,  and  Prigogine14. 
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Figure  1.  A  CNN  with  a  cutout  view  which  exposes  an  inner  cell  Cijk  and  its  sphere  of  influence 
Sijk  ■  Only  those  cells  Cocfj1  located  within  S^k  are  coupled  to  cell  C^*.. 


called  input ,  threshold ,  and  initial  state. 

2.  An  interconnection  law  relating  one  or  more  relevant  variables  of  each  cell 
Cij  to  all  neighbor  ceils  Cki  located  within  a  prescribed  sphere  of  influence  Sij(r) 
of  radius  r,  centered  at  C ij. 

In  the  special  case  where  the  CNN  consists  of  a  homogeneous  array,  and  where 
its  ceils  have  no  inputs,  no  thresholds,  and  no  outputs,  and  where  the  sphere  of 
influence  extends  only  to  the  nearest  neighbors  (i.e.,  r  =  1),  the  CNN  reduces  to 
the  familiar  concept  of  a  lattice  dynamical  system  from  mathematics. 

The  schematic  diagram  of  a  3-dimensional  CNN  is  shown  in  Fig.l,  where  a 
typical  cell  C^k  is  highlighted  along  with  its  sphere  of  influence  S^k-  Let  us  consider 
some  examples. 

Example  1.  Emergence  of  3-Dimensional  Knot  Patterns 

Suppose  each  cell  in  Fig.l  consists  of  a  Chua’s  circuit  with  three  external  ter¬ 
minals  (one  of  them  being  the  ground  reference  terminal),  as  shown  in  Fig. 2.  Since 
each  cell  can  interact  with  its  neighbors  only  through  the  2  ungrounded  terminals, 
each  one  coupled  to  a  corresponding  node  of  a  neighbor  cell  via  a  resistor,  each 
ungrounded  terminal  serves  as  a  port  where  energy  can  flow  into  or  out  of  the  cell. 
Consequently,  the  3- terminal  circuit  cell  in  Fig. 3  is  also  called  a  2- port  in  electri¬ 
cal  engineering.  Suppose  each  cell  in  Fig. 2  is  coupled  to  its  6  nearest  neighbor  cells 
(two  along  each  coordinate  axis)  via  positive  linear  resistances,  as  depicted  in  Fig. 2. 
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A  section  of  the 
coupling  circuit 


Chua’s  Circuit 


Sphere  of  influence  Sijk  of  cell  C(ijk)  containing  all  cells  C(affy) 


sijk 


Figure  2.  Each  cell  in  this  3-dimensional  CNN  is  a  Chua’s  oscillator  with  3  external  terminals, 
or  2  ports,  with  a  common  ground.  Each  external  terminal  (except  the  ground),  or  port,  of  each 
cell  is  connected  to  a  capacitor  inside  the  cell,  and  coupled  to  a  corresponding  terminal  of  the  6 
neighboring  cells  via  6  linear  passive  resistors. 


If  all  elements  inside  the  cell  shown  in  the  inset  are  passive,  and  if  the  nonlinear 
resistor  is  not  locally  active  (e.g.,  a  pn  junction  diode)  in  the  sense  of  Definition  1 
in  Sec. 4. 2,  then  it  can  be  shown  that  given  any  initial  voltage  distributions,  all 
node-to-datum  voltages  must  tend  to  zero  as  t  - — >  oo.  In  other  words,  this  CNN 
must  have  a  homogeneous  (uniform)  solution  at  all  nodes.  This  homogeneity  is 
generally  expected  since  all  cells  in  Fig.7,  and  their  couplings,  as  well  as  boundary 
conditions,  are  identical,  and  there  are  no  input  sources .c 

We  assume  throughout  this  paper  that  the  CNN  has  no  input  sources  and  has  a  symmetrical 
boundary  condition  (e.g.,  a  zero-flux  or  Neumann  boundary  condition),  since  the  central  concept 
of  emergence  implies  that  any  non -homogeneous  output  pattern  must  emerge  via  self  organization , 
and  not  through  any  external  input,  or  non-symmetrical  boundary  conditions. 


Figure  3.  Three-dimensional  knot  voltage  distribution  patterns  from  a  CNN  cube  made  of  Chua’s 
oscillator  cells,  coupled  via  a  linear  passive  resistive  grid,  (a)  A  Hopf-link  pattern,  (b)  A  3-knot 
pattern,  (c)  A  simple  helix,  (d)  A  double  helix. 


However,  if  the  nonlinear  resistor  is  chosen  to  be  a  Chua’s  diode  23 ,  as  shown 
in  the  inset  in  Fig.2,  then  for  the  fixed  choice  of  circuit  parameters  22,  the  4 
distinctly  different  3-dimensional  structures  in  Fig. 3  can  be  obtained  in  steady 
state  by  choosing  the  4  different  sets  of  initial  conditions  22 .  Note  that  this  CNN 
is  symmetrical  with  respect  to  the  center  of  the  CNN  cube  in  Fig.2  and  there 
are  no  inputs.  Yet  we  have  a  non-uniform  constant  steady  state  node-to-datum 
voltage  distribution  !  In  the  parlance  of  complexity  theory ,  we  say  the  homogeneous 
CNN  undergoes  a  symmetry  breaking  and  any  non-homogeneous  node-to-datum 
steady  state  voltage  distribution  is  called  a  pattern ,  or  a  dissipative  structure  if 
the  medium  is  non- conservative  in  the  sense  that  energy  dissipation  is  essential  to 
maintain  the  structure. 
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Figure  4.  (a)  A  straight  scroll  vortex,  (b)  A  twisted  scroll  wave. 


Example  2.  Emergence  of  3-Dimensional  Scroll  Waves 

Consider  the  same  CNN  cube  as  in  Fig. 2  except  that  in  this  case,  only  the  port 
voltage  across  the  nonlinear  resistor (Chua’s  diode)  is  coupled  to  corresponding 
node  voltages  of  the  6  neighboring  cells.  This  is  equivalent  to  open  circuiting  the 
second  port  (on  the  right)  of  the  Chua’s  oscillator  in  Fig. 2,  so  that  each  cell  reduces 
to  a  one-port  circuit  with  2  external  terminals  (including  the  ground  reference 
terminal).  Using  the  circuit  parameters,  and  the  initial  and  boundary  conditions  22 , 
we  obtain  the  two  scroll  wave  structures  shown  in  Figs.4(a)  and  4(b),  respectively, 
for  one  instant  of  time.  Unlike  the  stationary  structures  shown  in  Fig. 3,  the  scroll 
waves  in  Fig. 4  represent  an  active  nonlinear  wave  which  evolves  continuously  with 
a  scrolling  structure  at  all  times. 

Example  3.  Emergence  of  2-Dimensional  Spiral  Waves 

If  we  consider  a  2-dimensional  version  of  the  preceding  CNN  cube,  we  would 
obtain  the  simplified  2-dimensional  CNN  shown  in  Fig. 5,  where  each  “Chua’s 
oscillator” -one- port  is  represented  by  a  2- terminal  black  box,  coupled  to  its  neigh¬ 
bors  by  4  linear  positive  resistances,  two  along  each  coordinate  axis.  Using  the 
circuit  parameters  and  the  initial  and  boundary  conditions  given  in  22 ,  we  obtain 
the  spiral  wave  structure  shown  in  Fig. 6.  Again,  this  is  a  dynamic  pattern  which 
rotates  continuously  for  all  times. 
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Figure  5.  A  two-dimensional  CNN  made  of  Chua’s  oscillator  cells  coupled  via  a  single  layer  of 
linear  resistive  grid. 


2  Mapping  PDE  Into  CNN 


A  vast  majority  of  active  homogeneous  media  which  are  known  to  exhibit  com¬ 
plexity  in  the  form  of  dissipative  structures  are  modeled  by  a  reaction  diffusion 
PDE  14-18: 


where  x  =  (sci, 


are  state  variables,  (x,  y,  z )  are  spatial  coordinates, 


f  (x)  =  (/i(x),  /2(x), fn (x) )  is  a  nonlinear  vector  function  of  x  called  the  kinetic 
term ,  and  £>i,  D2,  Dn  are  constants  called  diffusion  coefficients.  Replacing  the 
Laplacian  in  Eq.(l)  by  its  discretized  version 

d2Xi  ^  d2X{  ^  d2Xi 


dx 2  dy 2  dz2 


(V2xa)i957)i 


(2) 


8 


where 


^2(xa,p,y)i  =  Xi{a  +  1,  p,  7)  +Xi  (a-  1,  P,  7)  +  ^(q,  (8+1,7)  +  *i(a,  jS  —  1, 7)  + 
Ii(“tA7+  1)  +aji(a, (3,7  —  1)  -6xi(a,p,j) 

i  =  1, 2, n  (3) 

is  the  discrete  Laplacian  operator  on  the  ith  component  Xi  of  the  state  variable 
x  =  (x\,X2,  xn)T  about  the  grid  point  with  spatial  coordinate  (a,/?, 7).  We 
obtain  the  following  associated  Reaction  Diffusion  CNN  equation 


xffa,  p,  7)  =  fi(x i(a,  p,  7),  »2 (a,  A  7),  ®n(a,  A  7))  +  AV2(xa>j9>7)i  (4) 

where  i  =  1,  2,  ...,72;  a  —  (1, 2, ....  AQ ,  0  -  (1, 2, Afc),  and  7  =  (1, 2, ....  W7) . 

Here  Zj(ai,  A  7)  denotes  the  state  variable  a;*  located  at  a  point  in  the  3-dimensional 
space  with  spatial  coordinate  (a,/?, 7).  Observe  that  the  Reaction  Diffusion  CNN 
equation  (4)  consists  of  a  system  of  TV  =  nNaNpNj  ordinary  differential  equations 
(ODE’s). 

We  will  henceforth  refer  to  the  process  of  transforming  a  PDE  into  a  CNN 
equation  as  mapping  a  PDE  into  a  CNN.  Table  1  shows  the  mapping  of  4 
well-known  reaction  diffusion  PDE’s. 


Table  1.  Mapping  Reaction- Diffusion  PDE  into  a  Reaction  Diffusion  CNN. 


FitzHugh-Nagumo  CNN  Equation 

( ui  \ 

Ui  =  ~~  \  T  ~~ Ui )  ~Vi 

H-Z>i  [ui+ 1  +  Ui-i  —  2  Ui] 

Vi  =  —  e\ui  —  bvi  7  a] 

FitzHugh-Nagumo  PDE 

du  ( u3  \  d2u 

k=-\J-u)-v  +  d'w 

dv 

—  -e[u  -  bv  +  a ] 

Brusselator  CNN  Equation 

iiij  =  a-  (6  +  l)uy  +  +  Di 

+Ui-ij  +  UiJ+1  +  Uij-!  -  4 Uij\ 

iHj  =  buij  -  ufjVij  +  D2[vi+ u 

+  Vij+1  +  Vij-x  -  4j >ij\ 

Bruss  elate 

du 

-^  =  a  - (b  +  l)u  +  i 

dv  9 

=  bu~  u*v  +  D2 

?r  PDE 

i2v  +  Di 

O: 

\d2v  d2v ' 
dx2  dy2 

2u  d2u 
x2  dy 2 

Meinhardt-Gierer  CNN  Equation 

auii 

utj  —  -  fiuij  +  Di  [ui+ij  +  ij 

vij 

Hi  T  uij—i  —  4 Uij] 

Vij  =  a  ufj  -  7 v^  +  D2  [vi+1j  +  v^  1j 

+vi,j+ 1  +  Vij- 1  -  4w^] 

Meinhardt-Gierer  PDE 

du  olu2  \d2u  d2u 

_  =  ___/*t+A|_  +  _ 
dv  2  F  d2v  <92t>] 

-  =  au  _7JJ+jD^_  +  _j 

Figure  6.  Three-dimensional  view  of  a  steadily  rotating  spiral  wave  at  one  instant  of  time.  The 
vertical  axis  (perpendicular  to  the  plane)  represents  the  capacitor  voltage  across  Chua’s  diode  and 
the  horizontal  axes  (on  the  plane)  are  the  spatial  coordinates. 
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Reaction  diffusion  PDE’s  form  an  important,  albeit  relatively  small,  subset  of 
the  universe  of  all  nonlinear  PDE’s.  For  any  other  nonlinear  PDE,  the  spatial 
derivative  (of  any  order)  of  any  state  variable  ui(t\x,y,z)  at  time  t  and  spatial 
location  d  (#,  y,  z )  £  R3  can  be  approximated  to  any  desired  accuracy  24>25  by 
finite  differences  involving  only  the  values  of  the  state  variable  Ui(t\ct',f3\  7')  lo¬ 
cated  at  a  finite  number  of  lattice  points  (x,y,  z)  =  (a',/?', 7')  £  Sapy,  where 
{OL^  Pil)  €  Z3  denotes  the  “discrete”  coordinates  of  a  3-dimensional  lattice  Z3,  and 
Sapy(r)  denotes  a  neighborhood  of  radius  r,  centered  at  (a,/3, 7).  Hence,  given 
any  nonlinear  PDE,  we  can  generate  many  (depending  on  the  desired  accuracy) 
approximate  discretized  systems  of  ODE’s  in  terms  of  a  3-dimensional  array  of 
state  variables  Xi( a,  A  7),  =  1,2,...,%  (3  =  1,2,...,%  7  -  1,2,...,%  More¬ 
over,  since  all  finite- difference  operations  involve  only  variables  located  within  a 
local  neighborhood,  we  can  always  decomposee  the  discretized  system  into  a  compo¬ 
nent  (the  isolated  cell)  which  involves  only  Ui(t;  a,  j3, 7)  at  the  lattice  site  (<*,£,7), 
and  another  component(  the  cell  coupling)  which  involves  all  neighboring  cells 
(o/,/?/,7/)  €  Sap7(r).  In  other  words,  given  any  nonlinear  PDE,  we  can  induce 
many  associated  CNN  equations —  the  examples  given  in  Table  1  represent  the  sim¬ 
plest  examples.  Although  it  is  not  true  that  the  qualitative  behaviors  of  a  nonlinear 
PDE  and  its  associated  CNN  equation  are  always  the  same — the  propagation  fail¬ 
ure  phenomenon26  is  a  case  in  point,  extensive  computer  experiments  have  shown 
that  for  the  vast  majority  of  cases,  the  respective  solutions  can  be  made  virtually 
indistinguishable  by  choosing  a  sufficiently  large  array  size  and  by  optimizing  the 
CNN  cell  and  coupling  parameters  22 . 

It  is  important  to  observe  that  partial  differential  equations  are  merely  mathe¬ 
matical  abstractions  of  nature.  The  concept  of  a  continuum  is  in  fact  an  idealization 
of  reality.  Even  the  collection  of  all  electrons  in  a  solid  does  not  form  a  continuum 
because  much  of  the  volume  separating  the  electrons  from  the  nucleus  of  atoms^ 
represents  a  vast  empty  space!  In  fact,  recent  works  by  Smoiin  and  his  colleagues 
have  proved  that  “the  spectrum  of  the  volume  of  any  physical  region  is  discrete"27 . 
In  particular,  quantum  mechanics  implies  that  at  extremely  small  distances  even 
space  is  made  of  discrete  bits! 

3  Local  Activity  Is  the  Origin  of  Complexity 

Let  Af  be  a  2-dimensional  CNN  associated  with  a  homogeneous  medium.  By  defini¬ 
tion,  a  CNN  is  defined  uniquely  by  cells  and  their  interactions.  Let  us  identify  each 
cell  C(j,  k)  as  a  nonlinear  m-port  defined  by  its  cell  dynamics,  where  m  is  equal  to 
the  number  of  state  variables  which  are  directly  coupled  to  its  neighbors,  as  depicted 
by  the  V  external  (ungrounded)  terminals  attached  to  each  cell  C(j,  k)  in  Fig.7 
for  a  reaction  diffusion  CNN.  Note  that  the  cell  C(j,  k)  may  contain  additional  state 

dTo  avoid  clutter  we  restrict  our  discussion  to  the  3-dimensional  Euclidean  space  R3.  The  same 
formulation  is  valid  for  any  dimension. 

cIn  most  cases,  the  decomposition  consists  of  just  the  superposition  of  these  two  components. 
However,  more  complex  decompositions  (e.g.,  nonlinear  functional  compositions)  may  be  required 
for  some  nonlinear  PDE’s. 

^The  ratio  between  the  diameter  of  the  electron  orbits  in  an  atom  to  the  diameter  of  its  nucleus 
is  of  the  order  of  105. 
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variables  which  are  not  directly  coupled  to  its  neighbors  and  hence  are  suppressed 
in  Fig. 7  to  avoid  clutter.5 

Let  us  identify  next  the  couplings  from  cell  C(j ,  k)  to  all  cells  within  the  3  x  3 
sphere  of  influence  Sjk  centered  at  (j,  k)  by  a  7-port  Tjk-  For  the  one-diffusion 
reaction  diffusion  CNN  shown  in  Fig. 5,  an  isolated  ceil  C(j,k)  and  its  4  coupling 
resistors  are  shown  in  Fig.8(a).  Note  that  the  5- terminal  coupling  circuit,  or  7-port 
Yjk  (7  =  4  in  this  case),  in  Fig. 8(a)  can  be  redrawn  into  the  form  of  a  grounded  4- 
port  (i.e.,  with  a  common  ground  terminal)  in  Fig. 8(b).  For  the  reaction-diffusion 
CNN  depicted  in  Fig. 7,  the  7-port  Tjk  is  composed  of  m  identical  grounded  4- 
ports(with  node  (7,  k)  as  the  ground  node)  made  of  4  identical  positive  resistances, 
so  that  7  =  4m.  Observe  that  a  CNN  is  completely  specified  by  the  m-port  cell 
C(j ,  k)  and  the  7 -port  coupling  Tjk  since  they  can  be  used  as  a  template  to  build 
up  a  CNN  of  any  array  size. 

Since  the  conductance  of  all  resistors  in  each  layer  “i ”  of  the  resistive  grid  in 
Fig.7  is  equal  to  the  diffusion  coefficient  Du  which  is  assumed  to  be  positive,  it 
follows  that  the  7-port  coupling  Fjk  is  passive.  If  the  cell  C(j,  k)  is  not  locally 
active ,  then  it  follows  from  symmetry  considerations  and  the  qualitative  theory  of 
nonlinear  networks28,29,30,31  that  the  CNN  must  have  a  unique  steady  state  solution, 
thereby  implying  that  all  nodes  belonging  to  the  same  layer  must  have  identical 
node-to-datum  voltages.  It  follows  that  no  patterns  or  dissipative  structures  can 
exist. 

In  the  general  case,  the  7-port  coupling  Tjk  may  consist  of  a  nonlinear  dy¬ 
namical  multi-port.  In  this  case,  the  dynamics  must  again  tend  to  a  homogeneous 
node  voltage  distribution  on  each  resistor  grid  if  both  the  m-port  cell  C(j,  k)  and 
the  T-port  coupling  Fjk  are  not  locally  active.  The  above  analysis  justifies  the 
following  fundamental  result  on  complexity: 

The  Local  Activity  Principle 

A  CNN  associated  with  a  homogeneous  non- conservative  (i.e.,  not  lossless )  medium 
having  a  zero- flux  boundary  condition  can  not  exhibit  patterns  or  dissipative  struc¬ 
tures  unless  the  ce//s,  or  the  couplings ,  are  locally  active. 


4  Local  Activity  for  Reaction-Diffusion  CNNs 

In  general,  each  cell  C(j,  k,  l)  in  a  reaction-diffusion  CNN  has  n  state  variables  but 
only  m  <  n  among  them  are  coupled  directly  to  its  nearest  neighbors  via  diffusion. 


9  A  state  variable  xk  is  suppressed  if  its  associated  diffusion  coefficient  Dk  is  zero. 


12 


In  this  case,  the  state  equation  of  each  cell  C(j ,  Ac,  Z)  assumes  the  form 

Vita  0  =  /iW(i,  Ac,  0.  ^0,  0, vny,  a,  0)  +  AV2Vi(j,  it,  o  " 

^0\  A:,  Z)  =  /2(V i (j,  A:,  Z),  V2(j,  A;,  Z), 140,  Ar,  Z))  +  D2V2V2(j,  k,  l) 

Kn(j,  Ac,  Z)  =  /m(Vi(i,  Ac,  Z),  V2y,  Ac,  Z), 140’,  Ac,  Z))  +  DmV2Vm0,  Ac,  Z) 
ym+iO', *, o  =  fm+imj, k, i), v2u, k,  i), vn(j, k,  i))  f  (5) 

Vnti,  *,  0  =  /n(^l(j,  K  Z),  Va(j,  *,  Z),  ....  Vn(j,  k,  2)) 

j  =  1,  2,  ...,NX;  k  =  1,  2, ZV„;  Z  =  1, 2, JV*. 

Alternatively,  we  can  include  the  remaining  Laplacian  terms  An+i  V2Vm+1(y,  k,  Z), 
Dm+2'V2Vm+2(j,  k,  Z),  ....  DnV2Vn(j ,  k ,  l)  in  Eq.(5)  and  set  £>m+i  =  £>m+2  =  ...  = 


Dn  =  0.  Recasting  Eq.(5)  into  vector  form,  we  obtain 

Va=fa(Va,V6)+DaV2VQ  (6) 

V6  =  f6(Va,V6)  (7) 

where 

va  =  [Vi (j,  k,  Z),  V2(j,  k,  Z),  ...Vm(j,  k,  Z)]T  (8) 

Vfc  =  [Vm+1(j,  k,  Z),  Vm+2(j,  k,  Z),  k,  l)]T  (9) 

f«  =  [/l(-)./2(').-/m(0]  (10) 

^  =  [/m+l(')i  /m+2(')i  •••/«(')]  (11) 

V2Va  =  [V2Vi(j,  fc,  Z),  V2V2(j, k, Z), ...,  V2Vm(i,fc,Z)]  (12) 


D„  = 


D  i 


£>2 


A 


A  >  0 


(13) 


To  emphasize  that  a  CNN  is  defined  by  specifying  the  dynamics  of  the  cells  and 
their  couplings  ( cell  interaction  laws),  let  us  rewrite  Eqs.(6)-(7)  into  the  following 
standard  form  32: 


cell  dynamics: 


V„  =  fa(Va,Vi,)+I, 
V6  =  f6(V0,V6) 


(14) 

(15) 
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cell  interaction  laws: 

la  =  g a  (va(j,  k,  l),Va(j  -  1,  k,  l),  Va(j  +  1,  k ,  l),  Va(j,  k-1,  l),Va(j,  k  + 1,  l), 
Va(j,k,l-l),Va(j,k,l  +  l)) 


'Di 

~V2Vi(j,k,l)' 

d2 

V2V2(j,k,l) 

Dm  _ 

y2vm(j,k,i)_ 

It  is  important  to  observe  that  Eqs.(14)-(15)  defining  the  cell  dynamics  involve  only 
the  voltage  variables  (Va(j,  A;,  /),  Vfo(y,  A:,  0)  an(l  the  current  variables  I a(j,  k,  l)  at 
the  same  spatial  location  (j,  fc,  Z),*'  whereas  Eq.(16)  defining  the  cell  interaction  laws 
involves  not  only  the  voltage  variable  Va(j,  A;,  l)  at  spatial  location  (y,  A;,  Z),  but  also 
those  of  the  neighboring  cells.  The  relationship  between  a  typical  cell  C(j ,  k ,  Z) 
at  location  (y,  A;,  l)  and  its  coupling  network  is  shown  in  Fig. 7  for  a  2-dimensional 
CNN  reaction  diffusion  equation,  where  Va  =  [Vi(y,  A;),  V^y,  A:), Vm(y,  /c)]T  and 
la  =  [ii(y,  A:),  /2(y,  A:), ...,  7m(y,  A;)]T  are  the  “m”  port  voltage  and  port  current  vari¬ 
ables.  In  the  special  case  when  m  =  1,  Fig.7  reduces  to  Fig.8(a),  where  cell  C(y,  A;) 
degenerates  into  a  one-port. 

4.1  Cell  Equilibrium  Points 

Let  us  define  the  static  characteristic  of  an  isolated  CNN  cell  by  setting  Va  =  0 
and  Vb  =  0  in  Eqs.(14)  and  (15);  namely, 


fa(Va,Vfc)  +  Ia  =  0 

(17) 

f6(Va,Vb)  =  0 

(18) 

where  Va  e  Rm,  Ia  €  Rm,  V6  €  Rn“m,  ia  €  Rm,  f&  €  Rnm.  Solving  Eq.(18)  for 
V&  in  terms  of  Va,  we  obtain 


V6  =  ga(Va)  (19) 

where  ga(-)  may  be  a  multi-valued  function  of  Va. 

Substituting  Eq.(19)  for  V&  in  Eq.17,  we  obtain  the  following  implicit  and 
possibly  multi-valued  function: 


Static  Cell 
Characteristic 


e(Vo,Io)  =  /o(V„,go(Vo))+IB  =  0  (20) 


^This  observation  allows  us  to  suppress  the  spatial  coordinates  in  Eqs.(14)-(15)  without  ambiguity. 
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Let  us  solve  Eq.(20)  for  Va.  In  general,  there  can  be  many  solutions  Va  =  V^Q^, 
Va(Q2),  •••,  Va(Qp)  for  each  Ia  =  Ia  €  Rm,  where  Va(Qi)  denotes  the  port 
voltage  solution  Va  at  the  ith  solution  Q*.  We  call  each  solution  Va  =  Va(Q4) 
a  cell  equilibrium  point  associated  with  Ia  =  Ia  e  Rm.  In  other  words,  each  cell 
equilibrium  point  Va(Qi)  is  parameterized  by  Ia  G  R™.  The  loci  of  all  such  cell 
equilibrium  points  calculated  explicitly  at  Ia  =  Ia,  as  Ia  ranges  over  the  entire  Tri¬ 
dimensional  Euclidean  space  Rm  is  identical  to  the  static  cell  characteristic  defined 
implicitly  in  Eq.(20). 


4-2  Cell  Complexity  Matrix 

Let  Qi  be  a  cell  equilibrium  point  associated  with  Ia  =  Ia(Q*).  Let  Va(Qi)  (ob¬ 
tained  from  Eqs.(20))  be  the  corresponding  cell  equilibrium  point  of  the  cell  state 
equations  Eqs.(14)-(15).  Let 


T//-)  \  A  Aaa  (Qi)  Aab(Qi)l 
[Aba(Qi)  A bb(Qi) 


(21) 


denote  the  m  x  m  Jacobian  matrix  associated  with  fa(VaiV&)  and  f6(Va,V&), 
evaluated  at  (Va(Q4).  V6(Qi))»  where  V6(Q4)  =  ga(Va)  from  Eq.(19),  namely, 


Small  Signal  CNN  Cell 
Coefficients  at  Qi 


Let  N(Qi)  be  the  linearized  CNN  cell  at  Qi  associated  with  the  state  Equations 
(14)-(15)  of  the  m-port  cell  C(j,k,l)  obtained  by  deleting  the  higher  order  terms 
in  the  Taylor  series  expansion  of  f0(V0l  V6)  and  fb(V0,  Vb)  about  Q<;  namely, 

Linearized  CNN  Ceil 
dynamics  at  Qi 

where  va  =  Va  -  Va 

voltages  and  currents,  respectively,  about  the  equilibrium  point  Q*. 
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Definition  1:  Local  Activity _ _ _ _ 

A  CNN  cell  C(j ,  k,  l)  is  said  to  be  locally  active  at  Qi  iff  there  exists  some 
\a(t)  and  some  time  T  >  0  such  that 

T 

S£(Qi )  =  [  <  v0(t),  i a(t)  >dt<  0  (25) 

Jo 

where  <  •,  ■  >  denotes  the  vector  dot  product,  and  vtt(£)  is  the  solution 
obtained  by  solving  Eqs.(23)-(24)  under  zero  initial  state  va(0)  =  0  and 

Vb(0)  -  0-  _ 

To  derive  a  test  for  local  activity,  let  us  take  the  Laplace  transform  of  Eq.(24)  to 
obtain 

sva(s)  —  Aaava(s)  +  Aa£,Vb(s)  +  i&W  (26) 


svb(s)  =  Abava(5)  -f  Ab&vb(s)  (27) 

where  va(5),  vb(5)  and  ia(s)  denote  the  Laplace  transform  of  va(t),  vb(0,  and 
ia(t),  respectively.  Solving  for  Vb(s)  from  Eq.(27),  we  obtain 

vb(s)  =  (si  -  Abb)_1A&ava(s)  (28) 

Substituting  Eq.(28)  for  v6(s)  in  Eq.(26)  and  solving  for  ia(s),  we  obtain 

ia(s)  =  Yg(s)va(s)  (29) 

where 

CNN  Cell  Complex¬ 
ity  Matrix  at  Qi 

is  called  the  CNN  cell  complexity  matrix  at  the  cell  equilibrium  point  Qi.  It 
follows  from  a  classic  theorem  in  circuit  theory  33  that  for  a  reaction  diffusion 
CNN  Cell  to  be  locally  active  at  Qi ,  Yq(s)  should  not  be  a  positive  real  matrix  at 
Qi.  Hence,  in  order  for  a  reaction  diffusion  CNN  equation  to  exhibit  complexity, 
it  is  necessary  that  the  cell  parameters  be  chosen  such  that  the  cell  complexity 
matrix  Yq(s)  is  not  positive  real  at  Qi.  The  mathematical  conditions  for  testing 
this  local  activity  property  is  as  follows: 
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Local  activity  criteria  for  reaction  diffusion  CNN 

A  Reaction  Diffusion  CNN  Cell  is  locally  active  at  a  cell  equilibrium  point  Qi 
if,  and  only  if  its  cell  complexity  matrixYq(s),  or  its  inverse  cell  complexity 

matrix  Z q(s)  =  YqX(s)  (In  this  case,  simply  change  the  symbol  from  Y 
to  Z  in  the  following  4  conditions)  satisfies  any  one  of  the  following  4 
conditions: 

1.  Yq(s)  has  a  pole  in  Re[s]  >  0. 

2-  Yg(iu;)  =  Yg(zcj)  -f-  Yg(io;)  is  not  a  positive  semi-definite  matrix  at 
some  u)  =  cj0,  where  u0  is  any  real  number,  and  j  denotes  the  Hermitian 
operator. 

3-  Yq(s)  has  a  simple  pole  s  —  iu;p  on  the  imaginary  axis  where  its 
associated  residue  matrix 

A  lims_+itJp(5  -  zcjp)Yq(s),  if  cjp  <  oo  | 

K-  -  lin^  if  Wp=oo| 

is  either  not  a  Hermitian  matrix,  or  else  not  a  positive  semi-definite  Her¬ 
mitian  matrix. 

4.  Yq(s)  has  a  multiple  pole  on  the  imaginary  axis. 


4-3  Reaction  Diffusion  CNN:  One  Diffusion  Coefficient 

Many  well-known  reaction  diffusion  CNNs  have  only  one  non-zero  diffusion  coeffi¬ 
cient  and  2  state  variables,  i.e.,  m  -  1,  n  =  2.  The  most  famous  example  belonging 
to  this  class  is  the  FitzHugh  Nagumo  CNN  Equation  given  in  Table  1.  For  this  class 
of  reaction  diffusion  CNN,  all  variables  in  Eq.(24)  are  scalars  and  let  us  rewrite  it 
using  the  established  notation  22  for  ease  of  reference: 


v  i  =  anV!  +  ai2V2  +  ii 

i)2  =  <*21^1  +  <*22^2  (31) 

The  inverse  CNN  cell  complexity  matrix  Yq1(s)  =  ZQ(s)  =  | associated  with 
Eq.(31)  is  a  1  x  1  matrix,  or  scalar  function  of  s  in  this  case,  and  is  given  by 


Zq(s)  = 


(f  ~  ^22) 
s2  -Ts  +  A 


(32) 


where 


T  —  an  +  &22 
A  =  ana22  —  ^12^21 

are  the  trace  and  determinant  of  the  associated  Jacobian  matrix 

an  &12 

&2\  a>22 

evaluated  at  the  cell  equilibrium  point  Qi . 


J  (Qi)  = 


(33) 

(34) 

(35) 
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Applying  the  above  local  activity  criteria  to  this  special  scalar  case  (with  Yq(s) 
replaced  by  Zq(s )),  we  obtain  the  following  useful  corollary: 

Local  Activity  Criteria  for  One  Diffusion  Coefficient  A  one-port  Reaction- Diffusion 
CNN  cell  with  one  diffusion  coefficient  and  two  state  variables  is  locally  active  at 
a  cell  equilibrium  point  Q  =  (Vi,  Ji)  if,  and  only  if,  any  one  of  the  following  4 
conditions  holds  at  Q : 


1.  an  +  a-22  >  0  or  anci22  <  012^21- 

2.  an  >  0,  or  an  >  ai^2°221 ,  if  an  <  0  and  022  ^  0. 

3.  ana22  >  fli2a2i>  an  +  a-22  —  0  and  022  7^  0- 

4.  ana22  —  012021,  o-n  -{-  022  =  0,  and  022  7^  0- 

(36) 


Although  the  above  local  activity  criteria  is  couched  in  terms  of  the  4  Jacobian 
coefficients  an,  a12,  a2i  and  a22,  the  criteria  can  be  recast  into  the  following  4 
equivalent  conditions  involving  only  the  3  parameters  A,  T,  and  a2 2- 


Equivalent  condition  1: 

T  >  0,  or 

A  <0 

Equivalent  condition  2: 

T  >  a2 2,  or 

T  <  a2 2  and  a22  A  >  0 

Equivalent  condition  3: 

T  —  0,  and 

A  >  0  and 

022  ^  0 

(37) 

Equivalent  condition  4: 

T  =  0,  and 

A  =  0  and 
a22  ~  0 

To  visualize  the  regions  in  the  A-T-a22  Euclidean  space  represented  by  the  above 
system  of  inequalities,  it  is  more  convenient  to  consider  a  A-T-a22  cylindrical  sub¬ 
space  and  partition  it  into  8  uniform  wedges  above  A  =  0,  and  8  uniform  wedges 
below  A  =  0,  as  depicted  in  Fig. 9.  In  terms  of  the  A-T-a22  cylinder,  Equivalent 
condition  1  is  represented  by  all  points  (022,  T,  A)  behind  the  vertical  separating 
plane  T  —  0,  and  all  points  below  the  horizontal  separating  plane  A  =  0.  Equiva¬ 
lent  condition  2  is  represented  by  the  wedges  labeled  (T],  [2],  [3J,  [T|,  [s],  [T],  and 
|~8~|  in  the  upper  half  (A  >  0) 
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Figure  7.  A  2-dimensional  reaction-diffusion  CNN  having  “m”  non-zero  diffusion  coefficients 
( Di  ^  0 ,i  =  1,2,  The  subscript  V  attached  to  (ji,ki)  denotes  the  node  in  the  ith  re¬ 

sistive  grid  layer  which  is  connected  to  a  terminal  of  cell  C(j,  k).  All  resistors  in  layer  i  are  linear 
resistors  with  identical  conductance  equal  to  Di  Siemens,  i  =  1,2, m. 


20 


cylinder.  Equivalent  condition  3  is  represented  by  the  vertical  separating 
plane(excluding  the  vertical  axis)  in  the  upper  half  (A  >  0)  cylinder.  Equiva¬ 
lent  condition  4  is  represented  by  a  single  point  at  the  origin.  It  follows  from  this 
partitioned  cylinder  that  a  reaction  diffusion  CNN  with  one  diffusion  coefficient  and 
two  state  variables  is  locally  active  at  a  cell  equilibrium  point  Q  if,  and  only  if,  its 
associated  parameter  (a22(Q),  T(Q),  A(Q))  at  Q  lies  outside  of  the  “blue”  sector 
|~6~1  in  the  upper  half  cylinder. 

4-4  Reaction  Diffusion  CNN:  Two  Diffusion  Coefficients 

Consider  next  the  class  of  reaction  diffusion  CNNs  with  two  diffusion  coefficients 
(Di  >  0  and  D2  >  0)  and  two  state  variables,  i.e.,  m  —  n  —  2.  Both  the  Brusselator 
and  the  Meinhardt-Gierer  CNN  equations  in  Table  1  belong  to  this  class.  In  this 
case,  the  CNN  cell  is  a  3-terminal  (including  the  ground  reference  terminal),  or 
2-port ,  device  whose  linearized  cell  dynamics  about  an  equilibrium  point  Qi  is 
described  by 


vi  —  anvi  +  ai2v2  -Mi 

V2  —  a2i^i  +  a22v2  -f  i2  (38) 

where  an,  ai2,  a2i,  and  a22  are  the  small-signal  cell  coefficients  at  Qi  defined  in 
Eq.(35).  Applying  the  local  activity  criteria  from  section  4.2  to  the  associated  2x2 
cell  complexity  matrix  Yq(s)  at  Qiy  we  obtain  the  following  corollary: 

Local  activity  criteria  for  two  diffusion  coefficients 

A  two-port  reaction-diffusion  CNN  cell  with  two  diffusion  coefficients  and  two 
state  variables  is  locally  active  at  a  cell  equilibrium  point  Q  =  (Vi,  V2,  fu  f2)  if, 
and  only  if,  any  one  of  the  following  two  conditions  holds  at  Q: 


1.  a22  >  0 

2.  4ana22  <  (a12  +  a21)2 

(39) 

For  a  detailed  application  of  the  various  local  activity  criteria  presented  in  Sections 
4.3  and  4.4  to  concrete  examples;  namely,  the  FitzHugh-Nagumo  Equation,  the 
Brusselator  Equation,  and  the  Gierer-Mainhardt  Equation,  the  reader  is  referred 
to  34 ,  35 ,  and  36 ,  respectively. 

5  Concluding  Remarks 

The  preceding  analysis  can  be  generalized  to  any  homogeneous  media  which  can  be 
mapped  to  a  CNN  defined  by  any  cell  dynamics,  and  any  cell  interaction  laws,  not 
necessarily  of  the  reaction  diffusion  type  presented  in  Section  4.  In  particular,  the 
coupling  7-port  Tjki  can  be  any  nonlinear  dynamical  multi-port.  In  such  cases,  in 
order  for  the  CNN  to  exhibit  complexity ,  either  the  m-port  cell  or  the 
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Figure  9.  The  A-X-a22  cylinder  depicting  the  locally  passive  region  represented  by  wedge  |_6j  in 
the  upper  half  cylinder.  All  other  regions  are  locally  active.  In  particular,  regions  [IT],  |~7~|,  and 
|~8~|  in  the  upper  half  cylinder  A  >  0  correspond  to  the  edge  of  chaos  where  most  complexities 
emerged. 
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coupling  7- port  r^-*,  must  be  locally  active  at  some  equilibrium  point  of  the  isolated 
(disconnected  from  the  rest  of  the  CNN)  cell  Cijk,  or  isolated  7 -port  coupling  F jkl- 
While  the  CNN  paradigm  is  an  example  of  REDUCTIONISM  par  excellence , 
the  true  origin  of  emergence  and  complexity  is  traced  to  a  much  deeper  new  con¬ 
cept  called  local  activity.  The  numerous  complex  phenomena  unified  under  this 
mathematically  precise  principle  include  self  organization ,  dissipative  structures , 
synergetics ,  order  from  disorder ,  far-from-thermodynamic  equilibrium ,  collective  be¬ 
haviors, ,  edge  of  chaos ,  etc. 

The  central  theme  of  the  local  activity  dogma  22  asserts  that  the  somewhat  fuzzy 
notions  of  “emergence”  and  “complexity” ,  as  well  as  their  various  metamorphosis, 
such  as  those  cited  above,  can  all  be  rigorously  explained  by  a  precise  scientific 
paradigm  abstracted  mathematically  from  the  principle  of  conservation  of  energy; 
namely,  a  CNN  operating  near  the  edge  of  chaos  34>37,  where  the  cells  are  not  only 
locally  active,  but  also  linearly  asymptotically  stable.  In  particular,  constructive 
and  explicit  mathematical  inequalities  are  given  for  identifying  the  region  in  the 
CNN  parameter  space  where  complex  phenomena  may  emerge,  as  well  as  for 
localizing  it  further  into  a  relatively  small  parameter  domain  called  the  edge  of 
chaos  (regions  [J],  |Tj,  and  [s]  in  Fig.9)  where  the  potential  for  emergence  is 
maximized. 

Consequently:  he  who  wants  to  have  right  without  wrong , 

Order  without  disorder , 

Does  not  understand  the  principles 
Of  heaven  and  earth. 

He  does  not  know  how 
Things  hang  together. 

Chuang  Tzu 
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Scale  invariance  (scaling)  in  a  time  series  of  an  observable  quantity  is  a  symmetry  law 
which  when  it  exists  can  provide  unique  insights  about  the  process  in  question.  It 
describes  variability  and  transitions  at  all  scales  and  is  often  a  result  of  nonlinear 
dynamics.  It  is  well  known  that  the  spectra  of  atmospheric  and  climatic  variables 
possess  considerable  power  at  low  frequencies.  Since  “red"  spectra  often  associate 
with  scaling  processes,  it  is  reasonable  to  suppose  that  a  search  for  scaling  laws  in 
climatic  data  might  be  fruitful.  Consequently,  the  search  for  scaling  in  such  data  over 
the  past  decade  has  produced  some  exciting  ways  to  describe  climate  variability.  In 
the  past  and  lately,  there  has  been  a  growing  interest  in  the  existence  of  uniform  in 
space  temporal  scaling  laws  for  observable  properties  of  the  climate  system,  since  such 
a  property  would  provide  a  common  rule  describing  temporal  variability  everywhere 
on  the  globe.  Here  we  show  that  in  spatially  extended  systems,  uniform  in  space 
scaling  demands  that  global  averages  be  time  invariant.  A  corollary  to  this  is  that 
where  global  averages  do  exhibit  temporal  variability,  as  in  our  climate  system,  spatial 
variation  in  scaling  properties  is  required. 


1  Introduction 

A  scaling  (fractal)  process  y(t)  satisfies  the  relationship  y(  t)  =do  ‘y(  where  =d 
indicates  equality  in  distribution  and  a,  k>0.  This  relationship  indicates  that  the 
statistical  properties  at  time  scale  t  are  related  to  the  statistical  properties  at  time  scale  Xt. 
Consequently,  any  moment  of  order  k,  |i'k,  satisfies  the  relation  fi'k(  t)  =a  |i'k  (  Xt)  .  It 
is  easy  to  show  that  the  power  law  ft'k(  t)  =AtH  with  H  =  logcr/logA,  is  a  solution  to  the 
last  equation  (Triantafyllouef  alf  ). 

Recently,  new  approaches  based  on  the  theory  of  random  walks  have  been  developed 
to  elucidate  scaling  in  time  series  (Tsonis  and  Eisner2,  Viswanathan  et  a/.3).  According  to 
these  approaches  a  time  series  x(t)  representing  some  observable  (temperature,  pressure, 
etc.)  is  mapped  onto  a  random  walk  whose  net  displacement,  y(t),  after  t  time  steps  is 

t 

defined  by  the  running  sum  y(  t)  =  £  x(  i)  .  For  any  walk  a  suitable  statistical  quantity 

t=i 

that  characterizes  the  walk  is  the  root  mean  square  fluctuation  about  the  average 
displacement  F2(  t)  =[Ay(t)  ]2  —  [ Ay ( t)  ]  2,  where  Ay(  t)  =y(t0+t)  -y(t0)  and  the 
bars  indicate  an  average  over  all  positions  t0  in  the  walk.  The  calculation  of  F(t)  can 

distinguish  three  types  of  behavior:  1)  uncorrelated  time  series  described  by  F(t)  «tH  with 
H=0.5  as  expected  from  the  central  limit  theorem;  2)  time  series  exhibiting  positive  long- 
range  correlations  described  by  F(t)«tH  with  H>  0.5;  and  3)  time  series  exhibiting 
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negative  long-range  correlations  described  by  F(t)  ~tH  with  H<0.5.  Markov  processes  with 
local  correlations  extending  up  to  some  scale  also  give  H=0.5  for  sufficiently  large  t.  It  is 
well  known  (Feder4)  that  the  correlation  function  C(t)  of  future  increments,  y(t),  with  past 
increments,  y(-t),  is  given  by  C(t)  =2(2  -  1)  .  For  H-0.5,  we  have  C(t)=0  as 

expected,  but  for  Ffe0.5  we  have  C(t)*0  independent  of  t.  This  indicates  infinitely  long 
correlations  and  leads  to  a  scale-invariance  (scaling)  associated  with  positive  long-range 
correlations  for  H>0.5  (i.e.  an  increasing  trend  in  the  past  implies  an  increasing  trend  in 
the  future)  and  to  scaling  associated  with  negative  long-range  correlations  for  H<0.5  (i.e. 
an  increasing  trend  in  the  past  implies  a  decreasing  trend  in  the  future).  Random  walks 
with  ffeO.5  are  referred  to  as  fractional  Brownian  motions  (fBms).  In  theory  the  exponent 
H  is  related  to  the  spectra  of  the  y(t)  function  via  a  relation  of  the  form  S(f)  ocf-<2H+1>  an(j  to 

the  spectra  of  the  x(t)  function  via  a  relation  of  the  form  S(f)  ~F2H+1  where  f  is  the 
frequency. 

2.  Analysis  and  results 

Let  us  assume  that  records  of  some  meteorological  variable  exist  at  a  sufficiently  large 
number  of  stations  (m)  evenly  distributed  over  the  globe.  Let  us  further  assume  that  all 
those  stations  obey  the  spatially  uniform  scaling  law  F(t)~tH  with  the  same  exponent  H.  If 
Xj(t),  j=l,m  are  the  records  of  the  stations,  then  the  global  (planetaiy)  mean  of  those 
records,  xg(t),  is  given  by: 


Xg(t)  =  ^LXj(t) 

j=l 


(1) 


It  follows  that  the  displacement  of  the  random  walk  generated  by  the  global  mean 
record  is: 


The  sum  in  the  bold  brackets  is  the  displacement  of  the  walk  for  a  particular  station. 
Since  we  have  assumed  that  at  each  station  the  records  obey  the  law  F(t)~tH  with  the  same 
H,  then  the  outer  sum  is  zero  (as  it  represents  the  average  displacement  after  n  time  steps 
of  many  random  walkers  with  the  same  exponent).  In  this  case  the  above  equation  reduces 
to  yg(t)=0  which  will  indicate  that  the  global  mean  xg(t)  is  also  zero  at  any  time. 

This  theoretically  derived  result  can  be  verified  by  simple  computer  simulations. 
Consider  m  stations  at  which  some  variable  y  has  been  observed  and  that  this  quantity 
scales  with  the  law  F(t)~  tH  with  H=0.7  at  all  stations.  For  illustrative  purposes,  we  have 
generated  such  a  function  y(t)  for  m  stations  by  inverting  power  spectra  of  the  form 
f(2H+1).  The  formula  used  to  generate  y(t)  functions  for  t=l,  N  is  given  by 

y(0  =  Z  [Ck_a ( — - -)i_a]1/2cos  (2ntk/N  +  <|>k )  where  C  is  a  constant,  N  is  the  sample 

size,  <i>k  are  N/2  random  phases  uniformly  distributed  in  [0  ,  2ji]  ,  and  a=2H  +  1 

(Osbome  and  Provenzale5;  Tsonis6).  Then  using  y(  t)  =  £  x(  i)  the  time  series  x(t)  for 

i  =  1 

each  station  was  produced.  From  all  the  available  stations  we  then  estimated  the  global 
mean  xg(t)  for  two  sample  sizes  m  (Figure  1).  For  m=10  the  global  mean  fluctuates 
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significantly  above  zero  but  as  m  increases  the  global  mean  tends  at  all  times  to  zero.  This 
result  demonstrates  the  theoretical  proof  provided  by  equation  (2)  that  a  spatially  uniform 
scaling  law  requires  time  invariance  in  the  global  mean.  This  behavior  is  manifestly 
counter  to  that  of  our  climate  system,  which  exhibits  nonlinearities  and  variability  at  all 
time  scales.  Indeed,  in  a  recent  study  it  was  shown  that  for  the  global  temperature  record 
the  relation  between  yg(t)  and  t  involves  multiple  temporal  scaling  regimes  (Tsonis  et 

alJ ). 


3.  Discussion 

To  those  familiar  with  the  theory  of  random  walks  this  result  may  not  be  surprising. 
Nevertheless,  due  to  limitations  in  data  and  other  shortcomings,  applying  these  ideas  to 
problems  in  physical  sciences  is  often  misguided  and  the  wrong  conclusions  are  drawn 
(Koscienly-Bunde  et  a/.8).  From  the  above,  it  follows  that  in  spatially  extended  systems 
displaying  variability  at  all  time  scales  temporal  scaling  must  vary  in  space.  The  spatial 
distribution  of  scaling  must,  in  some  way,  reflect  the  dynamics  of  the  system.  For  the 
climate  system,  spatial  variation  in  scaling  has  been  clearly  demonstrated  in  a  recent  study 
of  the  500  hPa  height  field,  which  is  hydrostatically  linked  to  the  mean  temperature  of  the 
lower  troposphere  (Tsonis  et  al  9).  In  this  work,  local  scaling  patterns  were  linked  to 
specific  properties  of  the  atmospheric  general  circulation  (baroclinic  instability,  storm 
tracks  and  persistence  of  circulation  anomalies). 


Time  Steps 

Figure  1.  Simulated  global  mean  value  from  m  stations  each  one  of  them  obeying  the 
scaling  law  F(t)~  tH  with  H=0.7  (see  text  for  details). 
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We  present  numerical  results  for  backseat tering  of  electromagnetic  waves  by  fractal 
aggregates  of  fractal  dimension  <  2.  Single-scattering  analytical  result  shows  that 
Fourier  transform  of  the  density-density  mass  correlation  function  of  the  scatterer 
is  the  main  contribution  of  the  backscattering  cross-section  wavelength  variation, 
when  wavelength  is  in  the  fractal  range  of  the  scattering  object.  This  allows  to 
conjecture  that  multispectral  backscattering  analysis  could  be  used  to  measure  the 
geometrical  fractal  dimension  of  the  scatterer.  Illustrative  results  are  given  by 
using  several  recently  proposed  numerical  scattering  codes  accounting  for  multi¬ 
ple  scattering.  Applications  to  lidar  experiments  in  the  visible  range  are  briefly 
discussed. 


1  Fractal  Aggregates 

Aerosol  particles  commonly  consist  of  disordered  clusters  of  nanometer-size  grains. 
To  give  a  few  examples,  these  are  soot  particles  from  diesel  engines  1 ,  smoke  obscu¬ 
rants  2  or  haze  particles  in  planetary  atmospheres  3.  They  correspond  to  a  variety 
of  standard  two-step  coalescence  scenario  :  in  a  first  regime,  high- temperature 
nucleation  creates  compact  small  grains,  then  low-temperature  aggregation  forms 
disordered  large  clusters  in  which  the  grains,  previously  created,  keep  their  individ¬ 
uality.  Only  the  local  junctions  between  neighbouring  grains  can  be  structurally 
modified  by  local  melting,  forming  necks.  These  small  local  transformations  will  be 
neglected  in  the  following,  and  all  the  grains  will  be  considered  as  identical  homo¬ 
geneous  spheres.  All  what  will  be  written  below  for  large  fractal  aerosol  4  clusters 
can  be  translated  to  disordered  fractal  colloidal  aggregates  as  well,  although  the 
dispersing  fluid  medium  is  different,  but  the  cluster  structures  are  basically  the 
same  5. 

In  dense  atmosphere,  two  aggregation  models  are  relevant  6  :  the  irreversible 
Brownian  Cluster-Cluster  Aggregation  model  (BrCCA)  and  the  Reaction-limited 
Cluster-Cluster  Aggregation  model  (RCCA).  The  first  one  (BrCCA)  corresponds 
to  the  experimental  conditions  where  sticking  between  grains  is  so  strong  that 
aggregation  is  irreversible  and  rapid  7  :  sticking  is  permanent  once  formed.  On  the 
other  hand,  the  latter  model  (RCCA)  is  the  case  where  sticking  is  so  weak  that  only 
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Figure  1 .  Fractal  clusters  of  512  identical  particles  generated  respectively  with  the  Brownian  CCA 
model  (Dj  =  1.75),  and  with  the  Reaction-limited  CCA  model  ( Df  =2) 


the  strongest  structures  are  permanently  formed  8  .  In  this  slow  aggregation,  the 
clusters  are  more  compact  than  for  the  BrCCA  case.  Both  models  lead  to  fractal 
structures,  and  the  fractal  dimensions  are,  respectively,  Df  =  1.75  (for  BrCCA) 
and  Df  ~  2  (for  RCCA).  Fig.  1  gives  typical  pictures  of  3-dimensional  clusters 
numerically  built  with  such  models. 

Optical  response  of  such  structures  carries  informations  about  the  fractal  cor¬ 
relations.  The  most  widely  known  is  the  so-called  q~DJ- law  9  valid  for  small-angle 
neutron-  or  X-scattering.  This  gives  direct  access  to  the  fractal  dimension  of  the 
clusters  10 .  This  law  is  derived  in  the  case  where  scattering  is  so  weak  that  multi¬ 
ple  scattering  can  be  neglected.  It  should  be  corrected  when  multiple  scattering  is 
expected  to  play  some  role  -  as  when  the  refractive  index  is  significantly  different 
from  1,  and  fractal  dimension  larger  than  2  -.  Note  that  the  value  Dj  —  2  is  the 
expected  threshold,  some  logarithmic  corrections  may  arise  in  this  case. 

Properties  of  the  scattering  pattern  of  electromagnetic  wave  by  such  aggregate 
are  the  direct  consequence  of  interferences  between  waves  with  different  phases  due 
to  the  spatial  distribution  of  the  grains.  This  distribution  can  be  characterized 
by  the  distance-distribution  function  representing  the  density-density  correlation 
function  in  the  case  of  aggregates  of  N  identical  spheres  11  : 

p(f)  =  (i ’ 

where  the  summation  runs  over  all  pairs  of  distinct  grains.  The  6  is  the  Dirac- 
distribution,  and  average  is  here  over  a  large  set  of  independent  clusters  of  the 
same  fractal  dimension  and  the  same  number  N  of  grains.  This  function  P(r), 
after  orientation  averaging,  becomes  a  function  of  the  variable  r  only.  It  can  be 
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written  12  as  : 


with  a  positive  cut-off  function  fc.  Rg  is  the  radius  of  gyration,  given  by 


1  r°° 

R2  ~  -  /  r2  P(r)4nr2dr 

2  Jo 


(1) 


The  coefficient  in  Eq.(l)  insures  proper  normalization  of  the  distance- 

distribution  function.  For  simple  applications,  the  cut-off  function  fc  is  approx¬ 
imated  by  an  exponentially  decreasing  function,  such  that  9  : 


P(r)  = 


ADfR ff 


,£>/  — 3 


exp[- 


R, 


2 


(2) 


This  approximation  is  not  valid  in  general  and  a  much  more  precise  form  is  : 
fc(z)  ~  exp(—Zj>/2)  as  it  has  been  shown  in  12 .  The  latter  is  useful  for  numerical 
computation,  but  analytic  calculations  can  often  be  handled  only  for  the  exponen¬ 
tial  cut-off. 


2  Electromagnetic  Scattering  Codes 

First  exact  results  for  scattering  of  electromagnetic  field  on  some  large  objects  (i.e., 
of  size  comparable  to  the  wavelength)  were  obtained  in  the  classical  Mie’s  work  13 
for  spherical,  homogeneous  scatterers  irradiated  by  monochromatic  plane  wave. 
The  solution  is  then  given  as  a  series  of  vector  spherical  functions  which  (usually) 
converges.  There  are  several  natural  ways  to  extend  this  result  to  more  complicated 
cases.  One  of  them  is  to  consider  a  rigid  assembly  of  such  identical  balls,  and  to 
wonder  if  one  can  express  the  scattered  field  knowing  the  exact  response  of  any  of 
these  balls.  This  is  the  T-matrix  formalism  which  is  briefly  reviewed  in  the  following 
section.  Another  way  is  to  consider  a  non-homogeneous  ball,  where  the  refractive 
index  depends  continuously  on  the  distance  to  the  center,  for  example  according  to 
the  averaged  density  of  the  cluster.  This  method  is  not  well  known  and  is  discussed 
in  more  details  below  in  Sec.D. 

2.1  Orientation- Averaged  T-Matrix  Code 

In  principle,  for  the  case  of  rigid  aggregate  of  homogeneous  spheres,  the  cluster  T- 
matrix  method  is  the  exact  solution  of  Maxwell  equations  with  appropriate  bound¬ 
ary  conditions  14 .  Its  basis  is  just  the  linearity  of  Maxwell  equations  and  boundary 
conditions.  Writing  the  scattered  electric  field  in  terms  of  usual  expansion  : 

oo  n 

EsrM)  =  E  E  iEmn[al„Ngi(j)  +  VmnM^(j)}  ,  (3) 

n= 1 m=—n 

with  the  standard  notations  for  the  spherical  harmonics  functions  15 ,  the  method 
consists  then  in  developing  all  these  fields  over  a  unique  ensemble  of  vector  spheri¬ 
cal  functions,  by  systematic  use  of  the  addition  theorem  which  expresses  spherical 
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harmonic  functions  referring  to  the  center  of  some  grain  into  series  of  vector  spher¬ 
ical  functions  with  the  origin  on  a  reference  point.  All  the  fields,  included  the 
incident  one,  being  expressed  over  the  same  basis  of  vectors,  one  can  write  self- 
consistently  that  the  local  field  on  particle  j  is  the  sum  of  the  incident  field  plus  all 
fields  scattered  by  other  particles,  and  obtain  a  set  of  coupled  linear  equations  in 
all  the  expansion  coefficients  (a^n,  Hnn)-  Truncating  the  order  of  these  equations 
at  a  reasonable  level  of  accuracy,  the  problem  reduces  to  an  inversion  of  a  large  ma¬ 
trix  equation.  Moreover,  a  very  efficient  computational  method  has  recently  been 
proposed,  where  orientation  averaging  is  performed  analytically  17 .  Since  fractal 
clusters  are  essentially  non-spherical,  except  in  a  statistical  meaning ,  this  scheme 
provides  considerable  help  in  the  practical  computation  of  the  average  scattering 
coefficients. 


2.2  Mean-Field  Mie-Code 

The  T-matrix  equations  can  be  substantially  simplified  in  a  mean-field  approach 
where  all  the  grains  are  supposed  to  scatter  the  same  electromagnetic  field.  This 
has  been  done  previously  for  the  dipolar  scattered  fields  18 ,  but  this  was  recently 
extended  to  the  Mie  scattering  19 ,  that  is,  with  scattered  fields  of  the  form  (3). 
In  this  case  we  have  not  to  consider  self-consistent  equations  between  N  scattered 
fields,  but  just  one  self-consistent  equation  with  one  (mean-field)  scattered  field. 
Inversion  of  a  small  matrix  equation  is  needed  and  usually  can  be  performed  quickly 
on  any  small  computer.  It  is  difficult  to  judge  the  correctness  of  the  mean-field 
hypothesis,  but  numerically  it  leads  to  one  of  the  best,  now  available  approximations 
for  self-similar  aggregates  of  identical  grains  16.  Qualitatively,  great  inhomogeneities 
of  the  local  fields  should  be  avoided  and  this  means  that  the  system  is  far  from 
any  optical  resonance  20 .  Moreover,  the  geometrical  structure  itself  should  be  as 
homogeneous  as  possible.  This  means  for  fractal  aggregates  that  we  have  to  deal 
only  with  strictly  self-similar  objects,  like  the  clusters  generated  by  the  models 
discussed  here  (BrCCA  and  RCCA). 


2.3  Discrete- Dipole  Approximation 

This  method,  known  as  DDA  numerical  method  21  ,  consists  in  dividing  the  scat- 
terer  into  identical  pieces  small  enough  to  be  individually  considered  as  electromag¬ 
netic  dipoles,  but  large  enough  in  order  that  the  number  of  such  sub-units  be  not 
too  large.  All  these  dipoles  interact,  and  since  the  field  radiated  by  each  dipole  is 
analytically  simple,  the  method  is  exact,  as  far  as  space  discretisation  is  fine  enough 
to  insure  the  dipolar  representation  be  correct.  Below,  we  used  this  method  with 
one  dipole  per  grain.  This  is  correct  if  the  wavelength  is  much  larger  than  the  radius 
of  one  grain,  but  as  soon  as  the  size  parameter  becomes  of  order  unity,  we  should 
take  several  dipoles  per  grain,  but  the  computations  become  rapidly  important  from 
both  CPU  time  and  memory  points  of  view. 
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2.4  Fractally- Coated  Code 

This  section  is  about  a  new  approximate  computation  of  the  scattered  properties  for 
fractal  clusters.  Instead  of  taking  the  exact  solution  by  the  T-matrix  method,  then 
averaging  over  all  possible  orientations,  one  can  decide  first  to  average  the  geometric 
structure,  yielding  radially  decreasing  density  of  matter,  and  then  to  compute  the 
scattering  properties  as  if  it  was  for  a  ball  of  same  spherical  density.  More  precisely, 
consider  fractal  distributions  of  matter  with  fractal  dimension  Df.  Suppose  that 
these  are  fractal  aggregates  of  balls  of  radius  a.  If  they  are  statistically  radially 
symmetric,  one  can  identify  the  center  of  the  ball  closest  to  the  center  of  gravity  as 
the  center  of  coordinates  and  write  the  average  density  as  : 


p(r)  =  1  if  r  <  a  (4) 

0Dj  —  3 

if  a  <  r  <  R 

p(r)  =  0  if  r  >  R  , 


where  R  is  a  typical  radius  of  the  aggregate.  The  constraint  on  the  total  mass  of 
the  aggregate  gives  explicitly  the  value  of  R  since  we  must  have 


M(  oo ) 


[R  fr\DJ- a  2 

/  I  —  1  4 irr  dr 

Jo  ' a ' 


equal  to  the  mass  of  the  aggregate,  namely,  M  =  47ra3iV/3.  This  means  that 

R/a  =  (DfN/Z)1/D 1  . 


With  these  results  in  hands  one  can  compute  the  electromagnetic  field  scattered  by 
a  ball  of  radial  refractive  index  nav(r)  given  by  the  Maxwell-Garnett  rule  15 

2  /  v  n2(l  +  elp{r))  +  2(1  —  p(r)) 
n<,v[n  n2(  1  —  p{r))  +  2  +  p(r) 

with  n  the  refractive  index  for  the  grain.  This  comes  from  the  multilayered  sphere 
recursive  equations  22  which  are  both  stable  and  accurate. 


2.5  Numerical  Comparison  Between  Codes 

All  four  methods  have  been  implemented  into  numerical  codes  for  fractal  aggre¬ 
gates.  Firstly  we  compare  all  of  them,  the  T-matrix  method  being  considered  as 
giving  the  exact  results.  We  present  in  this  article  a  small  part  of  this  task,  more 
systematic  comparisons  are  in  preparation  23 .  We  have  decided  to  show  here  some 
comparisons  for  several  representative  values  of  refractive  index,  namely,  n  —  1.5 
(non  absorbing),  n  =  1.5  +  i  *  10-4,  n  =  1.5  +  i  *  10” 2  and  n  =  1.5  +  i  (strongly 
absorbing).  Just  one  size  is  discussed,  AT  =  512  -  for  which  the  fractal  features  are 
known  to  be  well  developed  and  the  grain-size  parameter  x  =  ka  =  27ra/A  is  be¬ 
tween  10-4  and  1.  For  these  parameters,  the  T-matrix  as  well  as  the  DDA  methods 
cannot  go  to  values  of  x  much  larger  than  1.  The  extinction  and  backscattering 
cross-sections  are  shown  for  the  four  computations  in  Figs.  2,  and  3. 
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Figure  2.  Numerical  comparisons  between  the  four  methods  described  in  the  text,  for  N  =  512 
RCCA  clusters  of  fractal  dimension  Df  =  2.  The  refractive  index  is  n  =  1.5  +  i  *  10“2,  and  the 
grain-size  parameter  re  varies  between  10“3  and  1.  Double-logarithmic  scale,  the  cross-sections 
have  all  been  divided  by  Nita2  for  normalisation.  Symbols  are  :  full  circles  (and  line)  are  for 
T-matrix,  diamonds  for  Mean-Field  Mie,  crosses  for  DDA,  squares  for  fractally-coated  methods, 
respectively. 
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Figure  3.  Numerical  comparisons  between  the  four  methods  described  in  the  text,  for  N  =  512 
RCCA  clusters  of  fractal  dimension  Df  =  2.  The  refractive  index  is  n  —  1.5  +  t*n”,  with  different 
values  of  n  between  0  and  1.  Note  that  the  data  for  n ”  =  0  (?.  e.  for  the  non-absorbing  index 
:  n  =  1.5)  has  been  shifted  to  n”  =  10“6  to  show  them  on  the  same  figure.  Double-logarithmic 
scale,  the  cross-sections  have  all  been  divided  by  iWa2  for  normalisation.  Symbols  are  :  full 
circles  (and  line)  are  for  T-matrix,  diamonds  for  Mean-Field  Mie,  crosses  for  DDA,  squares  for 
fractally-coated  computations  respectively. 
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The  results  are  quantitatively  comparable  even  if  the  fractally-coated  results 
are  indeed  less  precise,  leading  to  slight  overestimation  of  the  cross-sections  when 
x  becomes  large.  The  DDA  code  may  give  wrong  under-estimate  values  of  the 
backseat tering  cross-sections,  due  to  lack  of  numerical  stability,  as  shown  in  the 
x  —  10“ 1  case.  These  results  reinforce  the  conclusions  that  the  Mean-Field  Mie 
method  is  the  most  reliable  approximation  up  to  the  largest  values  of  x  attainable 
by  T-matrix  code  with  our  computers. 


3  Backscattering  Coefficient 

The  backscattering  coefficient  needs  particular  comment.  This  coefficient  is  im¬ 
portant  in  some  applications  like  lidar  experiments,  basically  because  in  such  ex¬ 
periments,  one  light  beam  is  emitted  and  the  backscattered  light  is  analysed  ~4. 
This  kind  of  experiments  can  be  handled  with  far  scattering  sources  because  of 
this  simple  geometry,  and  gives  access  to  precise  informations  about  remote  aerosol 
scatterers  25 .  Furthermore,  backscattering  is  also  phenomenologically  known  to  be 
important  to  distinguish  between  various  forms  of  scatterers  26 . 


3.1  Single- Scattering  Theory 


If  we  suppose  single-scattering  to  be  the  dominant  feature,  and  the  cut-off  function 
to  be  exponential,  then  the  orientation-averaged  intensity  Is  of  the  light  scattered 
at  angle  6  by  fractal  aggregate  of  size  N  and  fractal  dimension  Dj,  is  just  9 


Isca(N)  =  NIsca(l) 


sin[(D/ -  1)  tan  1(g£)] 


l-b(iV-l) 


P/ -!)<?£  (1  +  g2£2)^ 


l)/2 


(5) 


with  the  modulus  of  the  scattering  vector,  q  =  2&sin(0/2 ),  and  £  = 

Rg^/2/ Df(Df  +  1).  This  formula  allows  to  recover  properly  the  Guinier-regime 
for  qa  <  1  and  the  fractal  g-^-law  for  l/£  <  q  <  1/a.  Moreover,  for  extinction, 
we  are  interested  in  q  =  0  since  we  look  at  the  0  =  0  case,  and  so 


0’ea7<(-^ 0  —  N<Text{  1)  j 


as  a  consequence  of  forward  coherence.  More  precise  forms  of  extinction  cross- 
section  can  be  found  in  18 . 

The  case  is  totally  different  for  backscattering  since  in  this  case,  q  =  2k,  and 
the  waves  scattered  by  the  grains  do  not  have  the  same  phase.  More  precisely  : 


O’Jacfc(A^)  —  N  (Tbacki,  1) 


1  +  p-i) 


sin[(Dy  —  1)  tan  1(2fc<^)] 


(Df  -  l)2Jb£  (1  +  4 k2e){Df 


l)/2 


as  an  immediate  consequence  of  (5).  In  particular,  in  the  case  of  clusters  large 
compared  to  the  wavelength  1),  the  preceding  formula  approximates  as 

&back(N)  ~  NObacki^-)  ^1  +  (j 'ca)Df  ]  ’ 

with  finite  numerical  coefficient  Bds  depending  only  on  the  fractal  dimension  of  the 
scatterer.  This  means  that  when  ka  is  small  enough,  the  ratio  <Tback(N)/N(Tback(l) 
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is  essentially  proportional  to  (ka)~Df ,  or  equivalently 

‘“feS))  ~D/MA)  f°r  a<X<R ’■  (6) 

When  A  <  a,  then  the  ratio  crback(N)/Naback(  1)  is  close  to  1  and  its  loga¬ 
rithm  vanishes.  For  A  >  Rg,  the  phase  difference  between  the  grains  can  be 
neglected  and  we  recover  the  same  coherence  peak  as  for  forward  scattering  27 
°‘back{N)/N aback(l)  ~  N  independently  of  the  wavelength.  Relation  (6)  provides  a 
possible  measure  of  the  fractal  dimension  Dj  of  the  scatterers  analysed  at  different 
wavelengths.  This  is  in  principle  a  natural  way  to  examine  a  fractal  aggregate  since 
the  basic  property  of  such  an  object  is  self-similarity.  We  just  claim  that  analysing 
a  fractal  with  different  wavelengths  correspond  to  investigating  its  structure  with 
different  length  scales,  and  the  result  (6)  is  an  illustration  of  this  fact.  A  partic¬ 
ular  case  of  this  idea  has  previously  been  proposed  28  with  application  to  fractal 
Sierpinski  gaskets  . 

3.2  Multiple-Scattering  Results 

So,  in  principle,  relation  (6)  should  give  access  to  the  fractal  dimension  of  the 
clusters  if  one  can  measure  the  backscattering  cross-section  at  different  wavelengths 
in  between  the  typical  size  of  the  grains  and  the  typical  radius  of  the  aggregates. 
The  result  is  in  principle  strictly  valid  only  for  the  single-scattering  approximation. 
One  can  wonder  then  if  it  holds  also  when  multiple-scattering  is  no  more  negligible. 
In  fact,  we  can  qualitatively  argue  that  for  fractal  dimensions  Dj  <2,  multiple 
scattering  is  expected  to  be  small  compared  to  single  scattering.  So,  in  this  case, 
the  signature  of  the  fractal  structure  should  be  seen  anyway,  even  if  a  bit  modified. 
This  should  be  checked  on  numerical  simulations  taking  the  multiple  scattering  into 
account. 

But  first,  we  have  to  be  careful  about  the  fact  that  the  wavelength  dependence  of 
the  refractive  index  of  grain  material  is  not  generally  cancelled  by  such  simple  pro¬ 
cedure  as  taking  the  ratio  crback(N)f N<rback(  1),  when  multiple  scattering  is  present. 
Focusing  on  the  signature  of  fractal  structure,  we  suppose  here  that  correction  is 
done  in  such  a  way  that  the  refractive  index  of  the  grains  can  be  considered  as  con¬ 
stant  over  the  range  of  wavelengths.  Usually,  this  can  be  done  when  the  wavelength 
dependence  of  the  refractive  index  of  the  material  and  the  typical  size  of  the  grains 
are  known. 

Since  we  have  shown  before  that  the  Mean-Field  Mie  code  gives  results  very  close 
to  the  exact  T-matrix  computation  inside  the  range  of  analysis,  we  shall  discuss 
the  results  of  this  method  (Mean-Field  Mie  code)  which  can  easily  be  used  for  very 
large  values  of  the  grain-size  parameter. 

Fig.  4  gives  the  numerical  results  for  N  =  512,  and  the  two  kinds  of  fractal 
aggregates  :  BrCCA  with  Df  =  1.75  and  RCCA  with  Df  =  2,  for  the  refractive 
index  n  =  1.5  +  i*10“2.  The  single-scattering  result  (6)  is  well  recovered  over  about 
one  decade  in  both  cases.  This  range  is  the  expected  one  since  for  these  models  and 
N  =  512,  the  values  of  £/a  are  about  20  and  14  for  BrCCA  and  RCCA  respectively. 
The  left-hand  plateau  for  the  long  wavelengths  is  the  Rayleigh  coherent  regime 
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Figure  4.  Sketch  of  the  ratio  Vback{N)IN2(Tback(\ )  vs  the  grain-size  parameter  x  for  BrCCA 
(diamonds,  fractal  dimension  —  1.75)  and  RCCA  (circles,  fractal  dimension  —  2),  calculated  with 
the  Mean-Field  Mie  method.  Double-logarithmic  plot.  The  RCCA  data  have  been  divided  by  a 
factor  10  for  visual  separation  of  the  two  cases.  The  £)/- slope  -  derived  in  the  single-scattering 
theory  is  well  recovered  between  the  dashed  lines. 


and  in  this  case  we  expect  :  (Tback(^)  —  AfVjac]fc(l).  On  the  other  side,  for  the 
small  wavelengths,  one  begins  to  see  oscillating  behaviour  on  the  curves,  before  the 
possible  limiting  value  ~  1  /N .  These  results  show  clearly  that  the  single-scattering 
result  (6)  is  very  robust  in  these  cases  and  dominates  the  leading  behaviour  even  if 
multiple  scattering  is  taken  into  account. 


4  Conclusions 

We  briefly  reviewed  in  this  work  four  theoretical  methods  allowing  to  compute 
either  exactly  or  with  fine  accuracy  the  optical  cross-sections  of  fractal  aggregates. 
We  used  the  corresponding  numerical  codes  to  compare  their  results  for  two  sets 
of  typical  fractal  clusters  of  fractal  dimension  Dj  —  1.75  and  Dj  =  2,  relevant 
in  aerosol  and  colloid  physics.  For  reasonable  values  of  the  refractive  index  of  the 
material,  the  Mean- Field  Mie  method  is  seen  to  give  the  most  accurate  and  reliable 
results,  as  compared  to  the  exact  T-matrix  method,  which  is  limited  by  the  large 
amount  of  computations  when  the  grain-size  parameter  becomes  large. 

As  an  application,  we  have  studied  the  wavelength  dependence  of  the  backscat- 
tering  cross-section  for  wavelengths  between  the  size  of  the  grains  and  the  radius  of 
the  aggregates.  It  has  been  shown  that  such  analysis  gives  access  to  a  very  particu¬ 
lar  signature  of  the  fractal  structure,  and  could  in  principle  be  used  to  estimate  the 
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fractal  dimension  of  the  scatterers  by  lidar  experiments,  since  lidar  signal  depends 
essentially  of  the  backscattering  and  the  extinction  cross-sections  of  the  scattering 
aerosols.  This  should  give  a  new  alternative  to  measure  experimentally  the  fractal 
dimension  of  fractal  aggregates  by  electromagnetic  radiation  scattering. 


Note  :  The  numerical  scattering  codes  for  fractal  aggregates  are  available  upon 
request  from  pra@ccr.jussieu.fr  for  the  Mean-Field  Mie  and  DDA  methods,  and 
from  botet@lps.u-psud.fr  for  the  T-matrix  and  fractally-coated  methods.  The  basic 
T-matrix  code  may  be  found  at  http://www.giss.nasa.gov/  crmivn /,  and  original 
DDA  code  comes  from  http://www.astro.princeton.edu/  draine/scattering.html. 
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Multiple  scattering  of  classical  particles  on  a  system  of  point  scatterers  distributed 
in  space  in  a  fractal  fashion  is  considered  in  the  small-angle  approximation.  The 
asymptotic  regime  of  this  process  is  described  by  a  fractional  differential  equation 
generalizing  the  ordinary  diffusion  in  the  angle  space  equation.  The  solution  of 
the  problem  is  presented  both  in  terms  of  stable  distributions  and  in  terms  of 
Fox-functions.  Main  features  of  the  obtained  solutions  are  discussed. 


1  Introduction 


Small-angle  x-ray  and  neutron  scattering  experiments  are  efficient  tools  for  study¬ 
ing  the  structure  of  condensed  matter.  Measurements  of  the  light  or  cosmic  rays 
from  distant  sources  play  the  same  role  in  studying  the  large-scale  structure  of  the 
Universe.  From  mathematical  point  of  view,  the  problems  belong  to  the  class  of 
inverse  problems  solved  in  terms  of  the  multiple  scattering  theory. 

The  ordinary  multiple  scattering  theory  assumes  that  the  random  spatial  distri¬ 
bution  of  scatterers  is  a  homogeneous  Poisson  ensemble,  i.e.  different  scatterers  are 
placed  independently  of  each  other.  That  is  just  what  guarantees  the  exponential 
character  of  the  free  path  distribution  and  leads  to  the  Boltzmann  master  equation. 
In  the  region  of  large  depths  and  small  angles  the  equation  takes  the  form  of  the 
diffusion  (in  angle  space)  equation  called  the  Fokker-Planck  equation.  Its  solution 
is  nothing  but  the  two-dimensional  (in  angle  space)  Gauss  distribution  called  also 
the  Fermi  distribution.  However,  above-mentioned  assumption  becomes  invalid  for 
fractal  media  characterized  by  long-range  correlations  of  the  power  type. 

One  approach  to  solution  of  the  problem  is  developed  by  S.Maleyev  L  He 
has  considered  a  medium  with  fractal  pores  distributed  uniformly  in  space.  As 
a  result  the  superdiffusion  behaviour  of  the  angular  distribution  has  been  found. 
Another  approach  is  developed  in  our  previous  work  2,  where  the  fractal  medium 
is  considered  as  a  set  of  identical  point  scatterers  distributed  in  space  in  fractal 
manner  by  means  of  the  Levy  flight  procedure  (see  for  details  3,4,s).  This  model 
reveals  the  subdiffusion  behaviour  of  the  angle  distribution. 

The  present  work  combines  both  of  these  ideas.  We  consider  here  a  set  of 
fractal  clusters  distributed  in  space  in  a  fractal  manner.  As  a  result,  we  obtain  a 
two- parametrical  family  of  angular  distributions.  The  family  includes  the  Fermi 
distribution,  the  subdiffusion  and  superdiffusion  distributions  as  particular  cases. 
They  obey  fractional  differential  equations  and  are  presented  both  in  terms  of  stable 
distributions  and  in  terms  of  Fox- functions. 
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2  Basic  postulates  of  the  small-angle  scattering  theory 

Let  a  particle  move  from  the  origin  of  coordinates  along  the  z-axis.  After  the 
scattering,  the  particle  will  be  characterized  by  the  vector  angle  0  of  deviation 
from  the  initial  direction  Q0  of  the  motion 

©  =  Q  —  Qo . 

The  following  postulates  form  the  well-known  basis  of  the  small- angle  approxima¬ 
tion  in  the  scattering  theory. 

i)  The  particle  is  on  the  z-axis  during  all  its  motion  even  though  the  (small) 
vector  ©  differs  from  zero. 

ii)  Vector  ©  stays  constant  between  points  of  scatterings  Xi,Xi  +X2,  Xj  +  X2  + 
X3, . . .  and  undergoes  random  jumps  ©1,  ©2,  @3, . . .  at  these  points,  so 

N(x) 

©(*)  = 

i=  1 

where  N(x)  is  the  random  number  of  scatterings  on  the  segment  [0,  x). 

iii)  The  random  variables  XltX2t  X3, . . .  are  independent  of  each  other  and  have 
the  same  distribution  density  q{x). 

iv)  The  random  variables  ©1,  ©2,  ©3,  •  •  are  independent  of  each  other  and  of 
X\ ,  X2 ,  X3j . . .  and  identically  distributed  with  the  density  0 0). 

v)  The  distribution  <j(0)  is  concentrated  in  such  a  small  region  near  zero  that  the 
vectors  ©^  and  ©(z)  may  be  considered  as  two-dimensional  vectors  and  integration 
with  respect  to  0  may  be  extended  to  the  whole  plane  R2. 

On  these  assumptions,  the  angular  distribution  density  p(9,  x)  of  the  particle 
passed  the  path  z  is  given  by  the  sum 

X 

p(0,  x)  =  6(6)Q(x*)  +  o0)  J  Q(x  -  x')q(x')dx/ + 

0 

X 

a  *  cr(0)  J  Q(x  -  x')q  *  q(x’)dx'+ 

0 

x 

°  *  o-  *  00)  J  Q(x  -  x')q  *q*  q{x)dx  +  . . .  (1) 

0 

Here  *  means  convolution  of  distributions 


X 


0 


0  *  00)  =  J  00  -  0')0 0)d6\ 

R2 
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and 


Q{x)  = 


X 


Represent  (1)  in  the  form 

X 

p(0,  x)  =  J  Q(x  -  x)f{6,  x)dx  .  (2) 

o 

The  function  f(0 \  x)  is  the  density  of  collisions  and  can  be  expanded  in  the  collision 
number  series 

f(d,  x)  =  6(e)5(x )  +  <r(0)g(x)+ 


+<r  *  cr(0)q  *  q(x)  +  a  *  a  *  &{0)q  *  q  *  q{x)  +  .  . 


oo 

=  (3) 

n=0 

where 

a(°)(0)  =  <5(0),  =  cr(0),  ff(n)(0)=5r*...<r(0);  n>l 

n 


and  so  are  q^n^(x).  It  is  easy  to  see  that  the  distribution  (3)  is  a  solution  of  the 
master  equation 

X 

f(6,  x)  =  5(9)6(x)  +  J  dx'q(x  -  x')  J  d9'a0 -  e')f(6',  x1).  (4) 

0  R2 

Formula  (2)  together  with  master  equation  (4)  gives  a  complete  description  of  the 
problem. 


3  The  scattering  theory  for  a  homogeneous  medium 

The  ordinary  theory  of  multiple  scattering  appears  as  a  result  of  the  addition  of 
two  further  postulates  to  those  mentioned  above: 

vi)  The  mean  square  angle  of  a  single  scattering  has  a  finite  value 

oo 

(02)  =  J  a(e)e2dd  =  2ir  J  a{6)0zdd  <  oo. 

r2  o 

vii)  The  random  variables  X\,X\  +  X2,  X\  +  X 2  +  X3, . . .  form  the  Poisson 
ensemble,  i.e.  the  increments  Xi,  X2,  X3,  •  •  •  are  distributed  according  to  the  expo¬ 
nential  law 


q(x)  =  fie  ***,  fi>  0. 


(5) 
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Inserting  (5)  into  (2)  and  (4)  we  obtain  two  expressions 


X 

p(e,x)  =  e-^  Je^’f  (9,  x')dx’, 


X 

f{9,x)  =  5(9)S( x)  +  J  J  e?*' f{& ,  x')dx' 

R2  0 

which  after  simple  transformations  take  a  form  of  the  integro-differential  kinetic 
equation  (the  Boltzmann  equation) 

dp(0,  a:)  f  ^ 

— fa. —  =  -/*?(#>  x)  +  n  de'ar(o  -  d')p(e',  x) 

R2 

with  the  initial  condition 

p(9,0)  =  S(9). 

In  the  large  x-asymptotics,  the  equation  reduces  to  the  ordinary  diffusion  equation 

dp(0,  x)  9  ^  . 

—fa  =D^eV{S,x)  (6) 

with  D  =  2.  It  is  supposed  here  that  ff(9)  is  an  axially  symmetric  function 

depending  on  |0|  =  6. 


4  The  scattering  theory  for  a  fractal  medium 

A  passage  from  a  homogeneous  medium  to  a  fractal  one  is  performed  by  the  re¬ 
placement  of  postulates  vi)  and  vii)  with  the  following: 

vi) *  The  angular  distribution  of  particles  scattered  on  a  single  fractal  cluster  is 
characterized  by  the  power  law  1 

J  ~  A6~a,  0  — >  oo,  0  <  a  <  2.  (7) 

I  O'\>0 

vii) *  The  free  path  distribution  in  a  fractal  path  medium  has  a  long  power  tail 
too  2: 

Q(x)  ~  Bx~P ,  x  — ►  oo,  0  <  /5  <  1.  (8) 

Note  that  the  mean  square  angle  in  the  first  case  and  mean  free  path  in  the  second 
one  are  infinite. 

The  information  given  by  (7)  and  (8)  is  enough  to  find  the  asymptotic  behaviour 
of  the  distribution  p(0 ,  x)  at  large  distances  x.  To  do  this  we  apply  the  Fourier- 
Laplace  transform 

oo 

p{k,\)=  J  dd  J  dxeiB-Xxp(0,x), 

0 
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where  k  is  a  two-dimensional  vector  and  kO  is  the  scalar  product.  Formulas  (2)  and 
(4)  take  the  following  form: 

p(k,  A)  =  (1  -  q{X)]f(k,  A)/A, 


f(k,  A)  =  1  +  q(X)a(k)f(k,  A). 


Combining  them  we  get 


p(k,  A) 


l  ~  g(^) 

A[1  -  9(A)ct(£)] 


This  is  the  Montroll- Weiss  result  for  random  walk  with  trapping  in  a  two- 
dimensional  space  6  (see,  also  7  and  8). 

According  to  the  Tauberian  theorem  9,  the  relations  (7)  and  (8)  determine 
behaviour  of  transforms  a(k)  and  <?(A)  at  small  arguments: 


1  —  a(k)  ~  aka ,  k 


0, 


2~°T(1  —  a/2) 
r(l  +  a/2) 


1  -  q(X)  ~  bX13,  A  — »  0,  6  =  T(1  -  /3)B. 

In  their  turn,  these  expressions  make  it  possible  to  find  the  asymptotic  form  for 
p(k,X): 

p(k,  A)  ~  pas(k ,  A)  =  — ,  A  — >  0,  k  —*  0  (9) 

with  C  =  a/b ,  whence 


Pas(9,x] 


(2 


-V  [dX  [ 

7Tj  J  J 
L  R2 


dk 


\p~- 


\P  +  Cka 


ikd+Xx 


=  (Cxl3)-2/a^l3){e(Cxl3)-1/a)  (io) 

where 

=  (mJixJdSwrr^,ii+K  <n> 

V  R2 

On  the  other  hand,  Eq.  (9)  rewritten  in  the  form 

A Ppasik,  A)  =  - Ckapas(k ,  A)  +  A^\ 

can  be  considered  as  the  Fourier-Laplace  transform  of  the  fractional  differential 
equation.  In  terms  used  in  the  book  10  it  has  the  form: 

D0xO+Pas(e,  x)  =  -C(-Ae)a'2pas(0,  x)  +  ,S(.t)  (12) 

with  the  initial  condition 

Pas&0)  =  6(9). 

When  a  — ►  2  and  j3  —>  1  the  equation  reduces  to  the  ordinary  diffusion  equation 
(6)  with  D  =  C. 
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5  Results 

The  integral  (11)  giving  the  solution  of  the  equation  (12)  via  (10)  may  be  repre¬ 
sented  in  two  forms  11 .  The  first  of  them  is 

oo 

=  J  g^\ex^a)g(f\x)x^'adx  (13) 

0 

where  g[®  ( x )  and  (0)  are  the  one-dimensional  one-sided  stable  density  and  the 
two-dimensional  axially  symmetrical  stable  density  determined  by  their  Laplace 
and  Fourier  transforms  respectively: 


oo 

J  e~Xx  gf\x)dx  =  e 


J  ei%8g£\3)d0  =  e-k\ 

R2 

Stable  distributions  are  described  in  detail  in  11,12. 

The  second  form  uses  the  LT-functions  -  generalized  hypergeometrical  functions, 
also  called  the  Fox  functions  13, 


(aii  ^1)  •  •  ■  (ap>  &p) 

(bu/31)...(bqipq) 


m  °°  (  —  A \k 


j=l  k= 0 


where  m,  n,p,  and  q  are  integer  numbers  such  that  0  <  n  <  p  and  1  <  m  <  q;  a, 
and  f3j  are  positive, 

sjk  =  (bj  +k)//3j 


II  r(fe/  -  piSjk)  n  r(l  ~  ai  +  aiSjk) 

1=1, i^j  i~i 

Cjk  —  - — - - —  _ 

n  r(l  -  6;  +  PtSjk)  n  r (ai-aiSjk) 

l=m+ 1  l—n+1 

In  terms  of  FT- function,  the  angular  function  \f^a,^( 0 )  looks  as  follows 

-JL( 2' \2+“  #12  ( (2Y  (-1,  l/a)(-o/2,  <8/2)(l  -  a/2, 1/2)  \ 

2  32  (W  (0,  l/a)(— 1, 1/a)  )' 

,!,(<»./?) /m  1  #21  (9  (1, 1/a)  (1, /?/a)  \  , 

2  ()  ir(a0)2**23  (2  (1,  l/a)(l,  1/2)(1, 1/2)  J  ’  “  -2 

When  a  —  2  and  {3  =  1 

g^)(x)=5(x-  1), 
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Si2V)=^exP{-02/4}  (14) 

and  the  density  (13)  reduces  to  the  ordinary  result  for  a  homogeneous  medium 
Pas0,x)  =  ^-h-exp{-02/(4Ca;)} 

with  the  diffusivity  C.  This  is  the  two-dimensional  Gauss  distribution  with  the 
mean  square  angle 

(92)  =  J  Pas(0 ,  x)e2d§  =  4Crr,  (15) 

R2 

describing  a  normal  diffusion. 

When  a  <  2  and  =  1  we  obtain 

Pas(9,x)  =  {Cx)~2/agia)  (9{Cx)~1/a). 

The  mean  square  angle  is  infinite  now  (due  to  the  property  of  the  density  g^), 
but  we  may  take 

59  =  (Cx)1/a 

as  the  measure  of  the  angular  distribution  width.  This  formula  reveals  a  more 
rapid  broadening  of  the  angular  distribution  than  in  the  normal  case  (15).  We 
see  here  a  superdiffusion  regime  considered  in  1  (a  more  detailed  consideration  for 
^-dimensional  space  is  performed  in  14). 

When  we  have  the  opposite  case,  a  =  2  and  f3  <  1,  the  mean  square  root  of  the 
scattering  angle  0  exists  and  is 

-  (m>r f<1- 

This  is  a  subdiffusion  15. 

In  the  general  case  the  rate  of  broadening  the  angular  distribution  is  determined 
by  the  exponent  /3/a: 

S6  =  C1/axf3/a. 

Thus  the  multiple  scattering  reveals  the  subdiffusion  asymptotic  behaviour  if  f3/a  < 
1/2  and  superdiffusion  one  if  (3/ a  >  1/2.  Both  the  regimes  are  covered  by  term 
’’anomalous  diffusion”  (see  8,16  and  references  therein). 

However,  not  only  the  rate  of  broadening  the  angular  distribution  changes  by 
passing  from  homogeneous  medium  to  fractal  one:  the  very  shape  of  the  density 
changes  too.  Whereas  simply  the  two-dimensional  Gaussian  distribu¬ 
tion,  the  densities  a  <  2,  are  two-dimensional  symmetrical  stable  densi¬ 

ties  with  characteristic  exponent  a.  They  have  a  dome-shaped  form  near  zero  with 
the  maximum  value 

r(l  +  2/ a) 


^a,1)(0) 


47 r 
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long  tails  of  a  power  type 


(*,i) 


0) 


r(l  +  a/2) 

27T 


1  2 


sin(a7r/2)0  a  2,  0  -4  oo 


and  no  moments  of  the  order  >  a  (see  10).  On  the  contrary,  the  densities 
ft  <  1,  diverge  logarithmically  at  zero 

|  In  0| 


^\e)  ~ 


rapidly  falling  tails 


27rr(l  -  0)  ’ 

-.  fl(3/J/2-l)/(2-0) 

„  a — _ 

2  v  7  4x^/2^ 


0->O. 


x(0/2)  exp 


,  2(1  /?)/ (2  /3) 
and  finite  moments  of  all  positive  orders 

/ 


{-(2  -  (3)^/(2-«(0/2)2/^«} , 


0  — »  oo 


(16) 


R2 


02n9{2>p)(0)d9  =  4n  ^,W  +  . 

2  '  r(/3+l) 


Note  that  (16)  reduces  to  (14)  when  (3  — ►  1. 

In  the  case  a  =  1,  f3  —  1/2,  the  density  ^2(0)  is  expressed  in  terms  of  the 
incomplete  gamma  function: 

#£1,1/2)(d)  =  (47r)_3/,2e02/,4r(— 1/2,  02/4). 


Acknowledgment  s 

The  work  is  partially  supported  by  the  Russian  Foundation  for  Basic  Research 
(grant  #  98-01-03307).  I  am  grateful  to  G. Gusarov  for  preparing  this  manuscript 
for  printing. 


References 

1.  S.V. Maleyev,  Phys.  Rev .  B,  52,  13163  (1995). 

2.  V.V.  Uchaikin,  D.A.  Korobko,  Techn.  Physics  Letters ,  25,  435  (1999). 

3.  V.V.Uchaikin,  G.G. Gusarov,  J. Math. Phys.,  38,  2453  (1997). 

4.  V.Uchaikin,  I.Gismjatov,  G. Gusarov,  V.Svetukhin,  Intern.  J.  Bif.  and  Chaos , 
8,  977  (1998). 

5.  V.V.Uchaikin,  G.G. Gusarov,  D.A. Korobko,  Journ.  Math.  Sci.,  92,  3940 
(1998). 

6.  E.W.Montroll,  G.H. Weiss,  J.Math.  Phys.,  6,  167  (1965). 

7.  V.V. Afanasiev,  R.Z.Sagdeev,  G.M. Zaslavsky,  Chaos  1(2),  143  (1991). 

8.  V.V.Uchaikin,  Physica  A  255,  65  (1998). 

9.  W. Feller,  Introduction  to  Probability  Theory  and  Its  Applications  (Vol.  II, 
J. Wiley,  New  York,  1971). 


49 


10.  S.G.Samko,  A.A.Kilbas,  O.I.Marichev,  Fractional  Integrals  and  Derivatives  - 
Theory  and  Applications  (Gordon  and  Breach,  New  York,  1993). 

11.  V.V.Uchaikin,  V.M. Zolotarev,  Chance  and  Stability.  Stable  Distributions  and 
their  Applications  (VSP,  Utrecht,  The  Netherlands,  1999). 

12.  V.M. Zolotarev,  One- dimensional  Stable  Distributions  (Amer.  Math.  Soc., 
Providence,  R.I.,  1986). 

13.  A.M.  Mathai,  R.K.Saxena,  The  H -function  with  Applications  in  Statistics  and 
Other  Disciplines  (Wiley,  New  Delhi,  1978). 

14.  V.M. Zolotarev,  V.V.Uchaikin,  V.V.Saenko,  Journ.  of  Exper.  and  Theor. 
Physics ,  88,  780  (1999). 

15.  V.V.Uchaikin,  Journ.  of  Exper.  and  Theor.  Physics ,  88,  1155  (1999). 

16.  J.-P.Bouchaud  and  A. Georges,  Phys.  Rep.,  195,  127  (1990). 


SYMMETRIC  FRACTALS  GENERATED  BY  CELLULAR 

AUTOMATA 


51 


A.  BARBE  AND  F.  VON  HAESELER 
Department  of  Electrical  Engineering,  KU  Leuven , 

Kardinaal  Mercierlaan  94,  3001  Heverlee,  Belgium 

E-mail:  {  Andre.  Barbe,fvanhaes]@  esat.kuleuven.ac.be 

We  demonstrate  how  to  construct  fractals  which  are  generated  by  a  combination 
of  a  cellular  automaton  and  a  substitution.  Moreover,  if  the  substitution  and  the 
cellular  automaton  exhibit  certain  symmetry  features,  the  fractal  will  inherit  these 
symmetries. 
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1  Introduction 

It  is  a  well  known  fact  that  certain  cellular  automata  can  generate  fractals7’8’9’14’15. 
These  fractals  can  be  described  in  terms  of  hierarchical  iterated  function  systems12, 
graph  directed  constructions11  or  as  a  mixed  self-similar  set2. 

In  this  note  we  consider  cellular  automata  which  have  certain  symmetries.  After 
the  introduction  of  some  basic  concepts  and  formalism  about  substitutions,  fractals 
and  cellular  automata  in  Sections  2  and  3,  we  shall  show  in  Section  4  how  to 
construct  fractals  generated  by  a  cellular  automaton  such  that  the  fractal  inherits 
the  symmetries  of  the  cellular  automaton. 

2  Substitutions  and  fractals 

In  this  section  we  introduce  the  concept  of  a  substitution  which  is  needed  for  our  ap¬ 
plications.  We  discuss  how  substitutions  generate  a  compact  subset  which  exhibits 
a  hierarchical  self-similar  structure,  and  thus  is  usually  a  fractal  set. 

We  will  consider  sequences  (fj)jt z,  where  fj  belongs  to  a  finite  commutative 
ring  7 Z  with  0  and  1.  It  will  be  useful  to  represent  this  sequence  as  a  formal  Laurent 
series  with  coefficients  in  71,  as  /  or  as 

l=f(x)  =  j2fixj- 

jez 

The  set  of  all  such  Laurent  series  will  be  denoted  by  7Z(X).  The  support  supp(/) 
of  /  €  7l(X)  is  the  set  {j  \  fj  /  0}.  /  is  called  a  Laurent  polynomial  if  supp(/) 
is  finite.  The  set  of  all  Laurent  polynomials  is  denoted  by  7ZC(X).  If  supp(/)  C  N 
and  is  finite,  then  /  is  called  a  polynomial.  The  set  of  polynomials  with  coefficients 
in  71  is  denoted  by  7Z[X]. 

The  set  of  all  maps  a  :  71  ->  71  equipped  with  the  addition  and  composition  is 
denoted  by  Abb(7£).  With  0  we  denote  the  map  m0  and  with  1  we  denote  the 
identity  map  r  4  r  .  The  subset  of  all  maps  a  :  7Z  — >  7Z  with  a(0)  —  0  is  denoted 
as  Abb0(7£).  If  a  €  Abb(7£),  then  a  induces  a  map,  also  denoted  by  a,  from  7l(X) 
to  7Z(X)  which  is  defined  as  a(f)(X)  — 

For  k  €  N  the  set  {0, . . . ,  k  -  1}  is  denoted  by  [k\. 


52 


A  /^-substitution  transforms  a  sequence  z  into  a  new  sequence  by  replacing 
each  element  ft  by  a  string  of  k  elements  £0(/j)£i (ft)..  where  £,•  6 

Abb  (7£).  Here  follows  the  formal 

Definition  2.1  A  ^-substitution  £  m  a  k-tuple  £  =  (ti)le[k]  €  (Abb(ft))fc  w/iic/i 
defines  a  map  £  on  7Z(X)  given  by 

£(/)  =  E  *'&(/)(**)> 

1=0 

where  £j(f)(X)  =  J2jez  £i(fj)X*.  If  £  €  (Abbo  (S))k,  i.e.,  when  0  is  replaced  by  a 
string  of  k  0 ’s,  then  £  is  called  a  regular  k- substitution. 

Remark  1.  A  ^-substitution  £  =  (£f)je[*]  is  also  defined  by  a  polynomial  = 

Yj £  Abb(7l)[X]  of  degree  less  than  k  and  we  write  £(/)  =  P^(X)  o  f(Xk ), 
where  the  product  is  the  product  induced  by  the  composition  of  maps  and  the 
coefficients  fj  of  f(X)  are  considered  as  constant  maps  r  *->  fj .  The  polynomial 
P$  is  called  the  substitution  polynomial. 

2. If  Q(X)  e  7 Z[X]  is  a  polynomial  of  degree  less  than  or  equal  to  k  -  1,  then 
£q (/)  =  Q(X)f(Xk)  defines  a  /^-substitution  with  £j(r)  =  qjr. 

3. If  £i  and  £2  are  /^-substitutions,  respectively,  and  if  (X)  and  P$2(X)  denote 
the  respective  substitution  polynomials,  then  £  —  £i  o  £2  is  a  k?  substitution  with 
substitution  polynomial  P^(X)  =  P^(X)  o  P^2(Xk). 

We  also  need  to  consider  two-dimensional  sequences  with  gij  e  71. 

The  corresponding  formal  Laurent  series  representation  is 

g  =  f(X,Y)=  Y,gijXiY*i 

ijez 

7l(X,  Y)  is  the  set  of  all  such  two-dimensional  Laurent  series  (sequences).  Like  for 
the  one-dimensional  case,  we  say  that  g(X ,  Y)  is  a  Laurent  polynomial  if  gi  j  —  0 
almost  everywhere.  Like  for  the  one-dimensional  case,  two-dimensional  sequences 
can  be  transformed  by  a  two-dimensional  (k  x  /^-substitution,  which  replaces  each 
symbol  gij  by  a  (k  x  fc)-array  of  symbols  in  71.  Formally: 

Definition  2.2  Let  E  =  (£f,m)z,mG[fc]  €  (Abb(7Z))kxk .  The  map  E  :  7Z(X,  Y)  -> 
7Z(X,Y)  defined  as 

k—1 

h(£)=  E  xlYm^m{g){xk,Yk) 

l,m= 0 

is  called  a  (k  x  /^-substitution.  It  is  called  regular  if  He  (Abb0(7Z))kxk . 

Let  us  agree  here,  that  in  two-dimensional  sequences  and  arrays,  and  in  the 
corresponding  graphical  representations,  the  first  index,  which  is  possibly  associated 
to  the  symbol  X  in  formal  Laurent  series,  increases  along  a  horizontal  axis  which 
is  oriented  from  the  left  to  the  right;  while  the  second  index,  possibly  associated  to 
the  symbol  Y ,  increases  along  a  vertical  axis  which  is  oriented  from  top  to  bottom. 

Remark  l.If  S  is  a  regular  (k  x  ^-substitution,  then  E(7lc(X))  C  7lc{X).  In 
other  words,  the  set  of  Laurent  polynomials  is  invariant  under  regular  k-  or  (k  x  k)- 
substitutions. 
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2. If  Si  and  E2  are  (k  x  k) -substitutions,  respectively,  and  PE  1?  Pz2  the  respective 
substitution  polynomials,  then  Hi  oH2  is  a  ( k 2  x  /^-substitution  with  substitution 
polynomial  PEl oE2(X,Y)  =  PEl{X,Y)  o  PS2(Xk ,Yk). 

In  order  to  draw  the  connection  between  fractals  and  substitutions  we  define  a 
graphical  representation  of  elements  in  1ZC(X)  or  in  7 ZC(X,Y).  We  only  need  to 
introduce  the  graphical  representation  for  elements  in  7ZC(X,  Y ),  since  any  Laurent 
series  in  7Z(X)  has  a  natural  counterpart  in  7 Z(X,Y). 

Let  (%(R2),  dH)  denote  the  set  of  non-empty  compact  subsets  of  R2 ,  where  dn 
is  the  Hausdorff  distance  induced  by  the  Euclidian  metric  (or  any  other  equivalent 
metric)  on  R2.  Since  R2  is  a  complete  metric  space,  so  is  (H,dtf)5’6. 

Definition  2.3  Let  I(i,j)  =  [i,i  +  1]  x  [j,j  +  1]  .  Then  the  map  G  :  7£C(X,  Y)  — >• 
%(R2)  U  {0}  defined  as 


GO l)  =  WJ)  1 9ij  ^  0} 

is  called  graphical  representation. 

in  the  definition  above  can  be  considered  as  a  “highlighted  pixel”  at 
location  (i,j)  on  a  graphical  display  which  coincides  with  the  R2 -plane  (as  agreed: 
y-axis  positively  oriented  downwards). 

Remark  l.G(p)  =  0  if  and  only  if  g  =  0. 

2.  Any  map  7  :  71  \  {0}  -7  %(R2)  induces  another  graphical  representation  G7  by 
defining  G7(#)  =  U{(i,.;)  +7 (9ij)  |  gij  7^  0}.  This  corresponds  to  “  non-square 
pixels”,  where  the  pixel-shape  may  depend  on  gij. 

3.  The  concept  of  graphical  representation  is  quite  general:  when  S  is  a  ring  with  0 
and  1,  then  so  is  1Z  =  SD ,  with  0^  =  (0, 0, 0, ... ,  0)  and  In  =  (1,1,1,...,  1)  (both 
0 n  and  In  have  D  components).  So  it  is  possible  to  speak  about  the  graphical 
representation  of  a  two-dimensional  sequence  with  values  in  7Z  —  SD ,  where  gij 
is  just  an  element  of  SD. 

The  connection  between  fractals  and  substitutions  is  provided  by  the  following 
theorem. 

Theorem  2.4  8  Let  H  :  7£(X,  Y)  -4  7 Z(X,Y)  be  a  regular  ( k  x  k)- substitution  such 
that  for  all  r  ^  0,  there  are  i,j  such  that  £i,j(r)  ^  0.  If  g  G  Y)  and  g^0, 

then  the  sequence 

(^G(  S»(g))) 

is  a  Cauchy  sequence  in  (7 

The  proof  relies  on  two  facts.  Firstly,  H  being  regular  maps  Laurent  polynomials  on 
Laurent  polynomials.  Thus  the  above  sequence  is  indeed  in  77(R2).  Secondly,  again 
due  to  the  regularity  of  H  one  has  |G(H (g))  C  G(g)  for  all  Laurent  polynomials 
different  from  0.  These  two  observations  conclude  the  proof. 

Remark  l.The  limit  of  the  above  Cauchy  sequence  is  denoted  by  A(g). 

2. If  G7  is  another  graphical  representation,  then  the  above  theorem  remains  true. 
Moreover,  the  limit  set  does  not  depend  on  the  graphical  representation. 
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3. For  g  G  7 ZC(X,Y),  g  ^  0,  one  has 

A(g)^  U  A9ij)  +  (h3)> 

I  9i, jt^O} 

where  A(gi  j)  +  (i,j)  is  the  translated  limit  set  of  the  graphical  representation  of 
g(X,Y)=9i>j. 

Example  l.Let  7Z  =  Z2  and  let  the  3-substitution  be  given  by  the  substitution 
polynomial  P^(X)  =  id  4-  id  X2,  where  id  denotes  the  1  in  Abb(7£).  The  limit 
set  .4(1),  i.e.  the  limit  of  the  properly  rescaled  graphical  representation  of  the 
sequence  •  •  -0001000  •  •  *  under  the  substitution  0  \-¥  000,  1  i-4  101  is  the  triadic 
Cantor  set. 

2. Let  71  =  Z2  and  define  the  (2  x  2)-substitution  by  the  substitution  polynomial 
Pz(X,Y)  =  id  +  idY  +  id  XY .  The  limit  set  .4(1),  i.e.  the  limit  of  the  properly 
rescaled  graphical  representation  of  the  sequence  g(X,Y)  —  1  (a  single  value  1  at 

(ij)  =  (0, 0))  under  the  substitution  04  ^  ,  1  t-4  J  ^ ,  is  the  Sierpinski  triangle. 


3  Cellular  automata 

Substitutions  transform  a  sequence  into  another  sequence.  So  do  other  mechanisms 
known  as  cellular  automata.  We  review  briefly  some  essentials  which  we  need 
further. 

Let  S  denote  a  commutative  ring  with  0  and  1.  According  to  Hedlund10,  a 
cellular  automaton  with  states  in  the  ring  S  (actually  it  is  only  necessary  that  S 
is  a  finite  set)  is  a  continuous  map  A  :  <S(X)  -4  <S(X)  which  commutes  with  the 
shift  map.  This  implies  that  every  cellular  automaton  A  is  given  by  a  local  rule 
(j>  :  SNl +N*  -4 where  N\,  N2  G  N  and  A  is  defined  as 

A{f)  —  ^  ^  (Pifj—Ny  >  fj—Ni+l)  ■  •  ■  »  fj+Nz—lt  f j+N2)X^ . 
jex 

This  means  that  the  j- th  element  of  A(f)  is  obtained  from  /  as  a  function  (j) 
of  the  /-elements  in  a  fixed  finite  neighbourhood  of  j.  For  a  cellular  automaton 
A  :  <S(X)  -4  S(X)  and  an  initial  configuration  /  €  S(X)  we  define  the  orbit  0,4  (/) 
of  /  (w.r.t.  A)  as  the  two-dimensional  sequence  defined  on  Z  x  N,  where  row  t  G  N 
displays  the  t-th  iterate  of  A  on  /.  Formally,  and  in  Laurent  series  notation: 

oo 

oA(L)(x,Y)  =  '£At(f(x))Yt. 

£=0 

In  this  note  we  are  dealing  with  cellular  automata  A  =  A#  with  local  rule  <p  :  SD  -4 
S  such  that 

At(f)(X)  =  '52<Kfj-D+i,.--,fj)XK 
j'GZ 

Of  particular  interest  are  linear  cellular  automata1,  i.e.,  automata  with  a  <S-linear 
local  rule  <j> :  SD  -4  S.  In  this  case  the  cellular  automaton  is  also  described  by  the 
multiplication  with  a  ’’local-rule” -polynomial  R(X),  i.e.,  A^(/)  =  R(X)f(X). 
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We  are  particularly  interested  in  cellular  automata  with  the  additional  property 
of  being  Fermat9.  They  provide  a  framework  to  explain  the  existence  of  fractals 
which  are  generated  by  cellular  automata. 

For  k  e  N  \  {0}  we  define  the  k-th  power  7 rk  :  <S(X)  -»  S(X)  as  1 Tk(f){X)  = 
fj  Xjk  =  f(Xk),  i.e.,  7rfc  “inflates”  /  by  inserting  (k- 1)  0’s  between  the  elements 
of  the  sequence  /  (we  could  call  this  “/^-inflation”). 

We  now  recall  the  concept  of  a  cellular  automaton  having  the  A:-Fermat  property. 
This  implies  that  row  k  of  the  orbit  of  a  “^-inflated”  initial  sequence  equals  the 
“^-inflation”  of  row  1  of  the  orbit  of  the  “non-inflated”  initial  sequence,  no  matter 
what  the  initial  sequence  might  be.  Formally: 

Definition  3.1  1,9  A  cellular  automaton  A  :  S(X)  -»  S(X)  is  called  fc-Fermat  if 
(. Ak  o  7 r*)(/)  =  (nk  o  A)(f)  holds  for  all  f  G  S{X). 

Remark  1.  If  A  is  a  A;-Fermat  cellular  automaton,  then  the  sequence  0  is  a  fixed 
point  of  A.  Therefore  A(<SC(X))  C  <SC(X)  for  a  A;-Fermat  cellular  automaton. 

2. If  A  is  a  linear  cellular  automaton  defined  by  the  Laurent  polynomial  R(X ), 
then  A  is  A>Fermat  if  and  only  if  R(X)k  =  R(Xk). 

Example  l.Let  S  =  Fp« ,  the  field  with  pa  elements  and  of  characteristic  p  (p 
being  a  prime  number).  Then  any  linear  cellular  automaton  is  pa-Fermat. 

2.  Let  S  =  Zp«,  p  a  prime  number.  A  linear  cellular  automaton  is  p-Fermat  if  its 
local  rule  is  a  Laurent  polynomial  R(X)  of  the  form  R(X)  =  Q{X)P  ,  where 
Q(X)  is  a  Laurent  polynomial. 

3.  Let  S  =  Z6,  then  R{X)  =  3  +  4X  defines  a  2-Fermat  linear  cellular  automaton 
and  R(X)  =  2  +  4X  defines  a  3-Fermat  linear  cellular  automaton. 

4.  Let  S  =  Z5.  Then  the  local  rule  <j)  :  ->  Z5  defined  by  the  table  with  entries 

<Kx>y) 


x\y 

0 

1 

2 

3 

4 

0 

0 

1 

2 

3 

4 

1 

1 

0 

4 

2 

3 

2 

2 

3 

0 

4 

1 

3 

3 

4 

1 

0 

2 

4 

4 

2 

3 

1 

0 

induces  a  (nonlinear)  2-Fermat  cellular  automaton.  In  fact,  it  can  be  shown  that 
there  are  512  different  local  rules  0  :  -¥  Z5  which  define  a  2-Fermat  cellular 

automaton  on  Zs(X).  Which  demonstrates  that  not  all  A:-fermat  cellular  automata 
have  to  be  linear  cellular  automata. 

We  now  present  a  generic  example  which  illustrates  a  general  property  of  k- 
Fermat  cellular  automata.  Consider  the  cellular  automaton  with  S  =  Z3  and  local 
rule  R(X)  =  1  +  2X  +  X2  G  %z[X].  It  is  3-Fermat.  Part  of  the  orbit  0^(1)  is 
shown  below: 
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row  00  ■ ■ • 0001000000000000000000000000000000 • 
row  01  •  • • 0001210000000000000000000000000000 • 
row  02  • • ■ 0001101100000000000000000000000000 • 
row  03  * • *  0001002001000000000000000000000000 • 
row  04  • • • 0001212121210000000000000000000000 • 
row  05  ••• 0001100000001100000000000000000000 • 
row  06  • • • 0001001000001001000000000000000000 • 
row  07  • • • 0001211210001211210000000000000000 • 
row  08  • • • 0001102201101102201100000000000000 • 
row  09  • • • 0001000000002000000001000000000000 • 
row  10  •  • • 0001210000002420000001210000000000 • 
row  11  •  •  • 0001101100002202200001101100000000 • 
row  12  • ■ • 0001002001002001002001002001000000 • 
row  13  • • • 0001212121212121212121212121210000 • 


Row  3 n  +  Z,  l  E  {0,1, 2}  in  this  orbit,  i.e.  the  sequence  ^43n+*(l),  can  be  obtained 
from  row  3n  as  Al(AZn(l)).  By  the  3-Fermat-property,  row  3n  is  a  “3-inflated” 
version  of  row  n  (say  *  *  *  zabcd  •  •  •) ,  as  given  by  the  top  sequence  in  the  following 
scheme: 


c  0  0  d  0  0  *  •  • 

c  2c  c  d  2 dd--- 
b  +  cb  +  cOc  +  dc  +  dO  *  *  * 

(i) 

Also  represented  in  this  scheme  are  rows  3n  + 1  and  3n  +  2,  obtained  by  application 
of  the  rule  R(X ).  Observe  from  this  scheme  that  knowledge  of  two  successive 
elements  in  row  n,  say  (a,  6),  is  sufficient  to  determine  all  highlighted  values.  Now 
define  the  substitution  which  replaces  the  pair  (a,  b)  by  a  (3  x  3)-  array  of  pairs, 
as  follows: 


row  3n 
row  3n  +  1  • 
row  3n  +  2  ■ 


OOzOO  a  0  0 

2z  z  a  2a  a 
a  +  z  a  +  z  0 


b  0  0 

b  2b  b 

a  +  b  a  +  b  0 


(a,  b)  i — y 


(0,6)  (6,0)  (0,0) 

(a,  6)  (6,26)  (26,6) 

(0,  a  +  &)  (a  -J-  6,  a  +  b)  {a  -{-  6, 0) 


The  last  components  of  the  pairs  in  this  array  are  given  by  the  elements  of  the 
3x3-  array  indicated  in  the  above  three  sequences  (1);  the  first  components  are  the 
elements  that  immediately  precede  them.  It  is  now  clear  that,  if  we  replace  each 
element  (say  b)  in  row  n  by  the  corresponding  last  components  in  row  /  (l  =  0, 1, 2) 
of  the  substitution  array,  we  produce  row  3 n  +  1.  In  that  way,  the  orbit  above 
corresponds  to  the  last  elements  of  the  pairs  in  the  arrays  obtained  by  iterating  the 
substitution  starting  from  the  sequence  of  pairs  •  •  •  (0, 0)  (0, 1)  (1, 0)  (0, 0)  •  •  •  (i.e., 
row  0  rewritten  as  pairs  in  which  the  last  components  correspond  to  the  elements 
of  row  0,  and  the  first  components  to  the  corresponding  preceeding  elements).  We 
show  the  first  step  of  the  iteration  (elements  of  the  actual  orbit  are  underlined): 
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•  •  •  (0,0)  (0,1)  (1,0)  (0,0)  •  •  • 


4. 


(0, 0)(0, 0)  (0, 0) 
(0,0)(0,0)(0,0) 
(0,0)  (0,0)  (0,0) 


(0,1)(1,0)(0,0) 
(0,  i)(l,2)(2, 1) 
(0,I)(1,I)(1,0) 


(0,0)(0,0)(0,0) 

(1,0)(0,0)(0,0) 

(0,1)(1,1)(1,0) 


(0,0)  (0,0)  (0,0) 
(0,0)  (0,0)  (0,0) 
(0,0)  (0,0)  (0,0) 


In  the  next  step,  each  pair  is  again  subjected  to  the  substitution  1$r.  In  general,  the 
pairs  will  be  £)-tuples,  and  the  substitution  will  replace  Z)-tuples  by  ( k  x  fc)-arrays 
of  D-tuples  (for  a  A;-Fermat  cellular  automaton).  We  now  express  this  property  as 
a  formal  theorem. 

Let  D  be  a  natural  number,  D  >  1.  We  define  an  embedding  i  —  id  of  <S(X)  into 
SD{X)  as  follows  (<SP(X)  can  be  considered  as  the  set  of  sequences  with  D-tuples 
as  values).  For  /(X)  =  Y^jezfjX'*  £  <S(X)  we  define  t(/)  €  SD(X)  as 


WO  =  £«>-*+ 1,  ■•■./*)*'• 

je  z 


We  also  define  a  projection  p  =  pd  *  SD(X)  — >  <S(X)  such  that  p  o  t(/)  —  /  by 

setting  p  (T,jez(s3-D+i,  Sj,-D+2,  ■  ■  ■  ,Sj,o) X3)  =  52jezsj,o X3 ■ 

With  these  notions  the  work  of  several  authors7*8’9’13’14  on  fractals  generated 
by  cellular  automata  can  be  summarized  as 

Theorem  3.2  If  A  :  5(X)  ->  S(X)  is  a  k-Fermat  cellular  automaton,  then  there 
exists  a  D  e  N  and  a  regular  ( k  x  k) -substitution  4/  =  :  SD(X,Y)  -» 

SD(X,  Y)  such  that  for  all  n  G  N  and  l  6  [k]  we  have 

Anfe+Z(l)  =p(cr/(t(^n(l)))), 


where  is  the  k- substitution 

The  interpretation  in  terms  of  fractals  generated  by  cellular  automata  is  clear. 
If  A  is  a  A;-Fermat  cellular  automaton,  then  the  initial  configuration  1  generates 
an  orbit.  If  the  orbit  is  visualized  by  a  graphical  representation,  one  observes  a 
certain  pattern.  According  to  Theorem  3.2,  the  pattern  can  be  described  by  a 
(k  x  ^-substitution  and  by  Theorem  2.4  the  pattern  is  represented  by  a 

compact  subset.  Moreover,  for  linear  fc-Fermat  cellular  automata  it  is  known  that 
the  limit  set  does  not  dependent  on  the  initial  configuration  /  €  SC(X). 


4  Cellular  automata,  symmetries  and  invariants 

It  is ,  worthwhile  to  study  cellular  automata  with  a  local  rule  (j)  :  S2  — »  S  which 
exhibits  certain  symmetries  3,4 . 

Definition  4.1  The  map  <f  :  S2  -4  S  is  called  rotationally  symmetric  if 
(j)((p(x,y),x)  =  y  holds  for  all  x,  y  G  S.  If  (j)  is  rotationally  symmetric  and  commu¬ 
tative,  i.e.,  <p{x,y)  —  (p(y,  x),  then  is  called  totally  symmetric. 
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The  following  geometric  interpretation  clarifies  the  definition.  We  represent  the  re¬ 
lation  2  =  (f)(x ,  y )  as  a  triangular  array  V,  and  call  it  an  elementary  ^-configuration. 
In  geometric  terms  the  rotational  symmetry  of  <p  can  be  phrased  as:  Any  rotation 
(by  of  an  elementary  (^-configuration,  changing  V  intoVor  V,  is  an  elementary 
(^-configuration. 

The  commutativity  of  (p  implies  the  invariance  of  elementary  (^-configurations 
under  vertical  reflection.  Combined  with  the  rotational  symmetry  of  (p  we  can  state 
that  (p  is  totally  symmetric  if  and  only  if  the  set  of  elementary  (^-configurations  is 
invariant  under  the  symmetry  group  of  the  equilateral  triangle. 

Example  l.The  local  rule  (p  :  %\  -¥  Z2  defined  as  (p{x,y )  =  x  -f  y  is  totally 
symmetric. 

2.  Let  F4  —  {0, 1,  C,  1  +  C}  denote  the  field  with  4  elements.  The  local  rule  <p  :  1F^  -> 
F4  defined  as  <p(x ,y)  =  (x  4-  (1  +  Qy  is  rotationally  symmetric  but  not  totally 
symmetric. 

3.  There  exist  rotationally  symmetric  local  rules  which  are  not  related  to  linear 
rules.4 

Definition  4.2  A  (j)- configuration  of  size  N  is  a  top  down  equilateral  triangular 
array  with  N  elements  in  the  top  row  and  with  values  in  S  such  that  any  subtriangle 
xz  is  an  elementary  (j)-triangle. 

A  <f>- configuration  of  size  N  is  rotationally  symmetric  if  the  rotated  ip- 
configuration  is  equal  to  the  (p- configuration. 

A  <p- configuration  of  size  N  is  totally  symmetric  if  it  remains  unchanged  under 
the  symmetry  group  of  the  equilateral  triangle. 

Figure  1  shows  examples  of  (^-configurations. 


Figure  1.  Examples  of  ^-configurations  of  size  21  for  p(x,y)  =  x  +  y  defined  in  Z2. 

(a):  a  nonsymmetric  one,  (b):  a  rotationally  symmetric  one,  (c):  a  totally  symmetric  one. 
(A  black  cell  represents  the  value  1,  a  white  cell  the  value  0). 


Theorem  4.3  4  If  (p  :  S2  — >•  S  is  rotationally  symmetric  and  of  the  form  <p{x ,  y)  = 
rx  +  sy,  then  there  exist  for  any  N  £  N  a  rotationally  symmetric  <p- configuration 
of  size  N. 

If  (p  :  S2  S  is  totally  symmetric  and  of  the  form  <p(x,  y)  =rx  +  sy,  then  there 
exists  for  any  n  6  N  a  totally  symmetric  <p- configuration  of  size  N. 
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As  a  next  step  we  introduce  invariants  of  cellular  automata.  The  motivating 
example  for  invariants  is  given  by  the  Ar-Fermat  cellular  automata.  Note  that  the 
map  7rk  :  S(X)  S(X)  is  a  special  kind  of  fc-substitution.  In  fact,  the  substitution 
polynomial  associated  with  nk  is  Pnk  ( X )  =  id  and  Ak  o  nk  =znko  A.  It  is  therefore 
natural  to  define 

Definition  4.4  Let  A  :  S  — >  $  be  a  cellular  automaton .  A  k -substitution  £  is  called 
^-invariant  of  A  if  Ak  o£  =  £o  A.  The  set  of  all  k-invariants  of  the  cellular  automaton 
A  is  denoted  by  Inv*(A).  The  substitution  £  is  called  a  regular  ^-invariant  of  A  if  £ 
is  a  k-invariant  and  regular .  The  set  of  regular  k-invariants  is  denoted  by  InvJ(A). 

Remark  If  k  =  1,  then  the  1  -invariants  of  a  cellular  automaton  A  with  local 
rule  <f  :  SD  S  are  given  by  Invi(A)  =  {£  :  S  ->■  S  |  <A(C(ri)»  •  •  •  (r^>))  — 

ru...,rD  G  «S}.  ...  , 

For  linear  A;-Fermat  cellular  automata  the  coefficients  of  the  substitution  poly¬ 
nomial  of  a  /c-invariant  £  can  be  characterized  via  the  local  rule. 

Theorem  4.5  If  A  is  a  linear  k-Fermat  cellular  automaton,  then  InVfc(A)  = 
FIi=i  Invi(,4). 

Proof.  Let  /  £  S{X)  and  let  R(X)  =  Ej=o  rjX’be  the  local  rule  of  A.  Since  t 
is  a  ^-invariant,  we  have  that  R{X)k(,{f)  =  {(B(X)l(X)).  Since  R  is  k- Fermat, 
the  left  side  becomes  R(Xk)t(f)  =  R(Xk )  X3UJ{Xk)  (using  the  definition 

of  0 .  Thus  the  coefficient  of  Xkl+j,j  €  [k],  in  R(Xk)£(f)  can  be  computed  as 
rotj(fi)  +  ntj(ft-i)  +  •  •  •  +  rD(,j{fi-D )•  On  the  other  hand,  the  same  coefficient 
in  £(R(X)f(X))  turns  out  to  be  £j(r0/;  +  •  •  •  +  td}i-d),  he.  ij  °  A  =  A  o 
Therefore,  £  is  a  fc-in variant  if  and  only  if  each  is  a  1-invariant  of  A.  • 

Example  l.For  S  =  Z2  and  R(X)  =  1  +  X,  we  have  Invi^)  =  {s  H  s,  m  0} 
(s  £  S)  and  lnvi(A)  =  InvJ(A).  The  automaton  specified  by  this  rule  is  2n-Fermat 
(for  any  n  £  N).  Thus  Inv^yl)  contains  22  regular  2-invariant  substitutions  : 
0  00,  le>  00;  0  ->.00,  1  ->  10;  0  ->  00,  1  ->  01  and  0  ->  00,  1  ->  11.  InvJ(A) 
contains  24  regular  4-invariants,  among  which  0  —>  0000,  1  — >  1011. 

2.  For  S  =  Z3  and  R{X)  =  2  +  2X,  we  have  Invi(A)  =  {s  ->  as  +  0  \  a,  /3  £  Z3} 
and  Invj (A)  =  (s  ->  as  \  a  £  Z3).  Inv3(A)  contains  33  regular  ^-invariants, 
among  which  0  — >  000,  1  — >  201,  2  — >  102. 

3 , R(X)  =  1  +  3X  G  Z6[X]  defines  a  2-Fermat  automaton  such  that  |Invi(A)|  =  54 
and  |InVj(A)|  =  18.  Thus  there  are  218  regular  2-invariants. 

In  analogy  with  the  substitution  'F  introduced  before  Theorem  3.2,  we  associate 
here  with  a  linear  fc-Fermat  automaton  A,  given  by  R(X)  =  s  +  rX,  and  a  k- 
invariant  f  £  Inv^(A),  a  (k  x  ^-substitution  =  'P (-4, 0  :  <S2(X,F)  -*  S2(X,  Y). 
Let  (a,  b)  £  <S2  and  consider  the  polynomials  Qm(X)  =  R(X)m^(aX~1  +  b)  = 
E  ez  Qi,mXi  for  m  £  [A:].  For  m  £  [fc]  the  coefficients  g_i,m,9o  ,m )  •  •  •  i  Qk  1 ,771  of 
Qm(X)  are  uniquely  determined  by  a  and  b ,  thus  these  coefficients  can  be  regarded 
as  functions  from  S2  to  S.  Therefore,  the  substitution  =  ($i7m)itme[k]>  defined 
as 


^Pi,m  :  S2  ->•  S2 

(a,b)  i->  {qi~i,m(a,b),qiiTn(a,b)) 
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for  Z,m  E  [fc],  is  well  defined  and  called  the  induced  substitution. 

Remark  1-  If  £  is  a  regular  ^-invariant,  then  the  induced  substitution  is  a 
regular  substitution. 

2.  If  the  local  rule  of  a  linear  cellular  automaton  is  given  by  the  polynomial  R(X)  € 
£[X],  of  degree  d,  and  if  £  is  a  regular  ^-invariant,  then  the  induced  substitution 
is  defined  by  maps  from  <Srf+1  which  are  constructed  in  a  similar  manner  as  for  the 
case  d-  1  considered  above.  ( d  -f  l)-tuples  are  then  replaced  by  a  (k  x  Jb)-array 
of  (d  +  l)-tuples. 

3. If  is  any  cellular  automaton  such  that 


and  with  a  ^-invariant  £,  then  the  above  construction  yields  an  induced  substitu¬ 
tion  as  well. 

Example  For  <S  =  Z2,  the  polynomial  R(X)  =  1  +  X  defines  a  22-Fermat  linear 
cellular  automaton  A .  By  Example  1  following  Theorem  4.5,  we  have  that  the 
substitution  £  defined  by  the  polynomial  P$(X)  =  id  +  OX  -h  idX2  +  idX3  defines  a 
4-invariant  of  A.  Consider  a  sequence  •  •  ■  zabcd  *  •  •,  then  substitution  £  transforms 
this  into  the  top  row  shown  below: 


•  - zO  z  z  a  0  a  a 
zzOz  +  aaaO 
Ozz+azOa 
z  a  a  z  a 


b  0  bb 
la  -f  b  b  b  0 
a  +  b  a  0  b 
b  bab 


c  0  c  c 
b  +  c  c  c  0  •  •  ‘ 
b  +  c  b  0  c  •  •  • 
c  c  b  c  ‘  " 


The  other  rows  are  part  of  the  orbit  generated  with  the  given  cellular  automaton 
rule  starting  from  the  first  row.  The  induced  substitution  can  be  read  directly 
from  these  rows: 


(a,  b)  i-» 


(a,  6)  (6,0)  (0,6)  (6,6)" 

(0,  a  +  6)  (a  +  6, 6)  (5,  b)  (6, 0) 
(a,  a  +  6)  (a  +  6,  a)  (a,  0)  (0, 6) 
(a,  6)  (6,6)  (6,  a)  (a,  6)  _ 


(2) 


The  following  theorem  is  a  straightforward  generalization  of  Theorem  3.2.  It 
exploits  the  existence  of  ^-invariants  instead  of  the  k- Fermat  property. 

Theorem  4.6  Let  A#  be  a  cellular  automaton  and  let£  be  a  regular  k -invariant  and 
let  ^  be  the  induced  k- substitution.  There  exists  a  natural  number 

D,  and  embedding  i  =  tD)  such  that  for  /  e  S(X)  with  £(/)  =  /  and  all  n  G  N, 
l  G  [&]  we  have 


Ank+l(f)  =  p  (al(i(An(f))))  , 
where  oi  is  the  k -substitution  • 

Proof.  Since  £  is  a  ^-invariant  and  /  =  £(/),  we  have  Akn(f)  =  Akn(£(f))  = 
£(An(/))  for  all  n  E  N.  That  proves  the  assertion  for  1  =  0.  The  assertion  for  1^0 
follows  from  the  definition  of  the  induced  substitution.  • 

We  illustate  this  theorem  by  continuing  the  above  example,  with  S  =  Z2, 
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R(X)  =  1  +  X  and  the  4-invariant  which  produces  the  following  substitutions: 
0  0000,  1  1011.  Take  /  as  the  sequence  which  is  the  limit  obtained  from 

applying  this  substitution  to  •  •  •  0001011000  •  •  •  (the  boldface  element  is  at  position 
0).  This  yields  the  top  row  in 

/  ...  0001011000010111011000000000000000010111011 • ■ • 

A(f)  • •  ■  0001110100011100110100000000000000011100110 • • • 

Observe  that  /  =  £(/).  Theorem  4.6  states  that,  for  example,  row  6  (=  1  •  4  +  2) 
in  the  orbit  Oa(£)  can  be  obtained  from  A(f)  displayed  above  ,  by  imbedding  this 
last  sequence  in  a  sequence  of  pairs  (corresponding  to  D  =  2),  as  given  by 

•  •  •  (0, 1)  (1 , 1)  (1,  a  (1  >  2)  (o,  I)  (1>  Q)  (0)  Q)  (0,  Q)  (0, 1)  (1 , 1)  (1,1)  ■  •  • 

(the  last  (underlined)  components  in  these  pairs  are  the  elements  of  A(f),  the 
first  components  are  the  ones  that  immediately  precede  them).  Then  apply  the 
substitutions  displayed  in  row  2  of  the  induced  substitution  array  given  above 
(see(2):  row  2  is  actually  the  third  row,  as  counting  starts  at  zero).  This  gives 

*  •  *  (0, 1)(1,0)(0,0)(0, 1)| (1,0) (0,1) (1,0) (1,1) |(1, Q)(0,1)(1,0)(1, 1)|(1, 1)(1,1)  *  •  * 

The  last  components  in  these  pairs  form  row  6  in  0^(/).  In  a  similar  way,  using  row 
l  of  the  ^-substitution  array,  where  l  £  {0,1, 2, 3},  starting  from  the  embedding 
of  row  An(f)  in  a  sequence  of  pairs,  would  produce  row  4  •  n  +  i  in  this  orbit. 

As  a  consequence  of  Theorem  2.4,  we  obtain 

Corollary  4.7  If  £  is  a  regular  k-invariant  of  the  cellular  automaton  A  and  if 
is  the  induced  substitution,  then  the  sequence  ^  i$  a  Cauchy 

sequence  for  all  /  £  <SC(X). 

As  a  final  step,  we  combine  the  notion  of  symmetry  of  a  cellular  automaton  with 
the  existence  of  induced  substitutions.  We  are  interested  in  the  following  problem. 
Suppose  /at-i  is  the  top  row  of  a  ^-configuration  of  size  N  and  assume  that 

the  configuration  is,  e.g.,  rotationally  symmetric.  If  £  is  a  regular  /^-invariant  of 
A#,  then  £(£]  fjXi)  gives  anther  top  row  of  a  ^configuration  of  size  kN\  what  are 
the  symmetry  properties  of  the  larger  configuration? 

Definition  4.8  Let  A $  be  a  cellular  automaton  with  local  rule  <f> :  S2  S  and  let 
the  substitution  £  —  (&)ie[k]  be  a  k-invariant  of  A.  The  k-invariant  is  palindromic 
if  £;  =  holds  for  all  i  £  [k\. 

The  k-invariant  £  is  called  rotationally  symmetric  if  the  induced  substitution 
=  (V n,m)  satisfies 

pW>i,o (a,  b))  =p(^jb-i,t(a,6)) 
for  all  (a,  b)  £  S2  and  all  i  £  [k\. 

If  £  £  Inv^(A)  is  palindromic  and  rotationally  symmetric,  then  £  is  called  totally 
symmetric. 

An  example  of  a  rotationally  symmetric  4-invariant  is  given  by  the  example  pre- 
ceeding  Theorem  4.6:  the  last  components  of  the  rightmost  column  in  the  4^-array 
equals  the  last  components  of  its  top  row. 
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Remark  If  f  —  (&)  is  a  /c-invariant  for  A and  the  induced  sub¬ 

stitution,  then  we  have  p(^j>0(a,  &))  =  &(&)  for  i  6  [A;]  and  there  exist  functions 
pi  G  Abb(<S),  i  6  [k],  such  that  p(^-i,i(a5  6))  =  p*(&).  In  particular,  if  £  is  rota- 
tionally  symmetric,  then  we  have  pi  —  £;  for  all  i  6  [A;]. 

Lemma  4.9  4  Lei  A  be  a  cellular  automaton  with  local  rule  <j>  :  S2  S.  If 
f(X)  G  S[X ]  is  of  degree  <  N,  then  f(X)  defines  a  rotationally  symmetric  0- 
configuration  of  size  (N  +  1)  if  and  only  if  for  the  coefficients  Oij  of  OA(f)(X,Y) 
the  equations  0i} o  =  Onj  hold  for  i  =  0, . . . ,  N. 

In  the  linear  case  a  carefully  performed  substitution  can  preserve  symmetries. 

Lemma  4.10  Let  A $  be  a  cellular  automaton  with  rotationally  symmetric  local 
rule  (f>(x,y)  =  rx  +  sy.  If  is  a  top  row  of  a  rotationally  symmet¬ 

ric  (j>- configuration  of  size  N  and  if  £  €  Inv^A^)  is  rotationally  symmetric ,  then 

9o,‘--,gkN-i,  where  g(X)  =  Yjez 9jXj  =  ^(E^Lo1/^),  is  the  toP  row  °f  a 
rotationally  symmetric  (j>- configuration  of  size  kN. 

Proof.  Let  f(X)  =  Zfjo  Uxi  and  O  =  Oa(J)(X,Y)  =  'EOi,jXiYi  its  orbit.  If 
denotes  the  induced  (k  x  /^-substitution,  then  ^(0)  =  Y  Qi,jXjYj  is  the  orbit 
of  £(/).  It  remains  to  prove  that  0jtO  =  ®kN-i,i  holds  for  all  i  =  0, ... ,  kN  -  1. 

By  Lemma  4.9,  we  have  O^0  =  Otv-m  for  i  =  0, . . . ,  N  -  1.  Since  f  is  rota¬ 
tionally  symmetric,  we  have  Skj+i,o  ~  fz(0j,o)  for  kj  +  l  G  [ATfc]  and  l  G  [A;]  and 
@kN-i,kj+i  =  €i(On-ij)  =  6(Oii0),  which  proves  the  assertion.  • 

Lemma  4.10  remains  true  if  rotationally  symmetric  is  replaced  by  totally  symmetric. 

We  can  now  establish  the  existence  of  fractals  with  prescribed  symmetries. 

Theorem  4.11  Let  A with  <j)(x,y)  =  rx  +  sy,  define  a  cellular  automaton  with 
a  rotationally  (totally)  symmetric  linear  local  rule.  Let  f(X)  G  <S[X]  of  degree 
<  N  define  a  rotationally  (totally)  symmetric  <j>- configuration  of  size  ( N  +  1) 
and  let  £  G  InvJ^A^)  be  a  regular  rotationally  (totally)  symmetric  k -invariant. 
If  tiOAif))  =  ^2 (Oa(D)  denotes  the  embedding  of  the  orbit  0>i(/)  which  replaces 
{O A {f))j,i  by  {fO  A (/)).?— i,i » ( OA(f))j,i ),  and  if^z  denotes  the  induced  substitution, 
then  the  sequence 

where  V  is  the  equilateral  triangle  given  by  the  points  (0,0),  (1,0)  and  (l/2,\/3/2) 
(ordinate  axis  positively  oriented  downwards),  is  a  Cauchy  sequence.  The  limit, 
denoted  as  X  =  X{f_, £),  is  a  compact  set  in  V  and  X  is  rotationally  (totally) 
symmetric. 

Proof  By  Theorem  2.4,  the  limit  1  /knG(^(i(f}))  exists  and  is  contained  in 
[0,  N  -f  l]2.  The  factor  transforms  the  triangle  {(x,y)  G  [0,  TV  +  l]2  |  y  <  x }  into 
an  equilateral  triangle  of  size  1.  Since  the  limit  is  independent  of  the  particular 
graphical  representation  (“pixel-shape”),  we  can  define  7  :  S2  \  {0}  -4  H(R2)  by 
setting  7(s_i,s0)  =  U{(i, 0)  |  Sj  ^  0 \j  —  —1,0}.  Then  the  rescaled  version  of 
G7(^(t(/)))  n  V  is  rotationally  (totally)  symmetric  for  each  n  G  N.  Thus  the 
limit  is  rotationally  (totally)  symmetric.  • 


Figure  2.  Symmetric  fractals  generated  by  cellular  automata,  (a)  and  (b):  states  in 
Z2  =  {0,1}  with  local  rule  <j>{x,y)  —  x  +  y  and,  for  (a):  with  top  row  gener¬ 
ated  from  the  initial  top-row  configuration  1101101101  and  the  16-substitution  0 
0000000000000000,  1  0110011000000110,:  for  (b):  initial  configuration  0110110110 

and  8-substitution  0  i->  00000000,  1  01111110.  (c):  states  in  F4  =  {0,1,  C,1  +  C}  = 

{0, 1,2,3}  with  local  rule  (j)(x,y)  =  (x  4-  (1  +  Qy,  initial  state  0130  and  4-substitution 
0  >->  0000,  1  H-  1021,  2  ^  2032,  3  3013.  (d):  states  in  Z3  =  {0,1,2} 

with  local  rule  <t>(x,y)  =  2x  +  2y ,  initial  configuration  11011011  and  9-substitution 
0  000000000,  1  002020200,  2  001010100.  (a)  and  (c)  are  rotationally  sym¬ 

metric,  (b)  and  (d)  are  totally  symmetric.  The  Hausdorff  dimensions  are  respectively 
log  117/  log  16  «  1.717,  log  36/  log  8  «  1.723,  log  13/  log  4  «  1.850,  log  54/ log  9  «  1.815. 


Figure  2  shows  a  few  examples  of  symmetric  fractal  limits  obtained  from  rescaled 
cellular  automata  orbits  generated  by  a  proper  initial  top-row  configuration  sub¬ 
jected  to  a  proper  substitution  as  presented  in  the  above  theorem. 
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5  Conclusions 

The  above  described  method  allows  one  to  construct  fractals  which  reflect  the  local 
rule  of  a  cellular  automaton  as  well  as  showing  a  global  symmetry.  Since  these 
fractals  are  generated  by  a  (fox  /^-substitution,  these  fractals  can  also  be  described 
in  terms  of  hierarchical  iterated  function  systems,  in  particular  by  so  called  k- adic 
HIFS8.  By  introducing  a  transition  matrix8  for  the  generating  substitution  it  is 
possible  to  compute  the  fractal’s  box-counting  dimension5,6  which  coincides  with 
the  Hausdorff  dimension. 
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The  bispectrum  is  the  natural  third-order  generalisation  of  the  power  spectrum.  It 
provides  information  about  correlations  between  different  Fourier  components  of  a 
signal  or  image,  and  about  the  statistics  of  Fourier  phase.  A  number  of  numerical 
and  experimental  studies  of  the  bispectra  of  chaotic  systems  have  been  published. 

In  this  paper  we  present  the  first  analytical  calculations  of  the  bispectra  of  chaotic 
dynamical  systems.  First,  for  a  generalisation  of  the  classical  sawtooth  or  Renyi 
map,  we  calculate  the  bispectrum  using  symbolic  dynamics.  Also,  for  intermittent 
systems,  we  calculate  the  bispectrum  using  the  relationship  between  these  systems 
and  renewal  processes.  We  review  the  results  of  these  calculations,  drawing  some 
conclusions  about  the  characteristic  features  of  the  bispectra  of  chaotic  systems, 
and  compare  them  with  the  features  of  some  financial  time  series. 


Key  words:  Chaos,  bispectrum,  time  series,  intermittent,  finance. 


1  Introduction:  power  spectrum  and  phase  in  time-series  analysis 

Given  an  experimental  time  series  xt ,  what  can  we  say  about  the  nature  of  the 
system  that  produced  it?  This  is  a  question  commonly  encountered  by  experimental 
scientists,  and  also  by  those  who  deal  with  financial  time  series  such  as  stock  prices 
or  currency  exchange  rates.  Often  the  first  attack  at  the  question  is  to  calculate  the 
power  spectrum  P{u).  The  power  spectrum  reveals  how  the  variance,  or  power,  in 
the  time  series  is  shared  between  the  different  frequencies  u. 

Because  the  power  spectrum  contains  no  phase  information,  it  is  not  possible  to 
reconstruct  the  original  signal  xt  from  it,  and  there  are  some  questions  that  can  not 
be  answered  by  inspecting  only  the  power  spectrum.  One  of  these  is  the  question 
of  whether  an  apparently  random  signal  originates  from  chaos  in  a  low- dimensional 
deterministic  dynamical  system  or  from  a  stochastic  process.1’2 

The  Fourier  transform 

x(u)  =  j2xteiut  (1) 

t 

contains  the  same  information  as  the  original  signal,  and  x  can  be  written  in  terms 
of  a  modulus  and  a  phase: 

x(u,)  =  \x(u)\ei'H“\  (2) 

where  the  modulus  |£(o;)|  determines  the  power  spectrum.  To  go  beyond  the  power 
spectrum  while  still  using  the  concepts  of  Fourier  analysis,  one  should  therefore 
consider  the  statistics  of  the  Fourier  phase  ).  In  fact,  the  phase  is  more  impor¬ 
tant  in  determining  the  appearance  of  an  image  than  the  Fourier  modulus  or  power 
spectrum.3  All  information  about  the  location  of  features  in  an  image  or  the  time 
of  occurrence  of  features  in  a  signal  is  contained  in  the  phase. 
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It  is  not  straightforward  to  study  the  statistics  of  phase  in  a  time  series.  One 
advantage  of  the  power  spectrum  is  that  it  is  a  well-defined  statistical  average, 
being  the  Fourier  transform  of  the  autocorrelation  function.  However,  stationary 
stochastic  processes  (including  those  defined  by  a  chaotic  dynamical  system  with 
its  invariant  measure)  do  not  possess  a  well-defined  average  phase  cf)(u)  or  e 
for  any  frequency  u. 

We  can  understand  why  the  Fourier  phase  factor  ei(f>^  does  not  have  an  average 
value  for  a  stationary  process  by  noting  that  translating  the  original  time  series  by 
a  delay  s  (that  is,  replacing  xt  by  is  equivalent  to  multiplying  each  Fourier 

phase  factor  by  eiu>s .  This  is  just  a  restatement  of  the  ‘shift  theorem’  of  Fourier 
analysis.  Since  any  time-average  of  a  stationary  process  must  be  invariant  under  a 
time-shift,  this  explains  why  the  phase  factor  does  not  have  a  well-defined  average. 

This  argument  also  provides  a  hint  of  how  we  might  construct  well-defined 
average  quantities  that  contain  phase  information.  For  any  two  frequencies  and 
u>2,  the  phase  product 

R(v  uu2)  =  (3) 

is  invariant  under  a  translation.  A  non-zero  average  of  this  quantity  implies  that  the 
phases  of  the  Fourier  components  at  frequencies  uXi  u2  and  Ui  +  u2  are  correlated. 

In  fact,  the  mean  value  of  R  is  the  phase  of  the  bispectrum,4’5  evaluated  at  the 
frequencies  (lo\^u}2).  Just  as  the  power  spectrum  P(u>)  is  the  Fourier  transform 
of  the  autocorrelation  function,  the  bispectrum  P( uuu2)  is  the  double  Fourier 
transform  of  the  bi-correlation  function 


C'x(S')  t)  ((#$  Xs)  ( Xt  x)  ( Xq  a?)) ,  (4) 

where  the  angled  brackets  ()  denote  an  expectation  value. 

The  bispectrum  can  also  be  estimated  directly  from  the  Fourier  transform: 

P(a>i,a>2)  =  ^lirr^  i  (xt(vi)xt(u>2)£t(vi  +  w2)) ,  (5) 

where  the  subscript  T  implies  that  the  sum  in  equation  (1)  is  over  the  first  T 
samples  in  the  time  series,  and  *  denotes  complex  conjugation. 

Higher-order  generalisations  of  the  power  spectrum,  known  as  polyspectra,  can 
also  be  defined4’5.  The  bispectrum  has  been  applied  in  time-series  analysis  to  con¬ 
struct  tests6,7’8for  nonlinearity  and  for  normality  (meaning  the  presence  of  a  normal 
distribution).  These  tests  depend  upon  the  observations  that  for  normal  systems, 
the  bispectrum  is  zero,  and  for  linear  systems  (that  is,  processes  constructed  by  a 
linear  operator  acting  upon  an  i.i.d.  process)  the  bicoherence 


^(^1,^2) 


\P(^u^)\2 

P(c0l)P(t02)P(u)l  +  U}2) 


(6) 


is  constant. 

The  bispectrum  has  also  been  used  to  study  phase  correlations  and  the  in¬ 
teractions  between  Fourier  components  in  many  experimental  systems,  including 
electroencephalographic  signals  from  the  human  brain,9  and  the  oscillations  of  the 
earth.10  A  number  of  authors8,11,12’13  have  studied  the  bispectra  of  time  series 
from  chaotic  dynamical  systems,  either  by  numerical  simulation  or  experimental 
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Figure  1.  Symmetries  of  the  bicorrelation  function.  For  example,  if  the  bicorrelation  is  given  by 
/(s,£)  in  the  region  A,  then  the  bicorrelation  in  region  C  is  given  by  f(t  -  s,s). 


measurement.  Subba  Rao8  identified  ‘ridges’  in  the  bicoherence  as  a  characteristic 
of  chaotic  systems,  and  other  authors  studying  electrical  circuits  have  noted  vari¬ 
ous  changes  in  the  bispectral  characteristics  of  time  series  as  the  systems  generating 
them  make  the  transition  to  chaos. 

To  date,  there  are  no  published  examples  of  chaotic  dynamical  systems  where 
the  bispectrum  can  be  calculated  exactly.  In  this  paper  we  present  an  example  of 
such  a  calculation,  for  a  simple  discrete-time  system  which  is  a  generalisation  of  the 
well-known  saw-tooth  or  Renyi  map.  We  also  show  how  the  relationship  between 
intermittent  chaotic  systems  and  renewal  processes  can  be  used  to  give  asymptotic 
low-frequency  expressions  for  the  bispectra  of  intermittent  systems.  We  identify 
some  common  features  of  these  two  families  which  may  be  bispectral  character¬ 
istics  of  chaos.  Finally,  we  compare  some  financial  time  series  with  the  chaotic 
examples.  Before  presenting  the  calculations,  we  briefly  review  some  properties  of 
the  bicorrelation  function  and  bispectrum. 


2  Symmetries  of  the  bicorrelation  function  and  calculation  of  the 
bispectrum 

This  section  summarises  some  properties  of  the  bicorrelation  function  and  bispec¬ 
trum  that  are  useful  in  deriving  the  form  of  the  bispectrum.  From  its  definition  in 
equation  (4),  one  can  see  that,  for  a  stationary  process,  the  bicorrelation  function 
Cx(s,t)  has  the  symmetries 

Cx(s,t)  =  Cx(t,s)  Cx(-s,t)  =  Cx(s,s  +  t).  (7) 

By  composing  these  symmetry  operations,  one  can  reach  any  part  of  the  (s,  t)  plane 
from  the  region  0  <  s  <  t  where  the  bicorrelation  is  most  easily  calculated.  These 
relationships  are  summarised  in  figure  1. 

The  bispectrum  P(u)  1,^2)  is  defined  as  the  double  Fourier  transform  of  the 
bicorrelation  function,  the  Fourier  integral  being  evaluated  over  the  entire  (s,t) 
plane.  Let  Pa(wi,  W2)  be  the  integral  over  only  region  A  in  figure  1.  If  the  value  of 
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f(x)  f(x) 


Figure  2.  The  two  types  of  dynamical  system  considered  in  this  paper:  the  generalised  sawtooth 
map  (left)  with  6  =  0.3,  and  the  intermittent  dynamical  system  described  by  equation  (28)  with 

,,  _  1  „  _  O  V 


fi  =  1,  z  =  2.7. 


the  bicorrelation  in  region  A  is  given  by  Cx(s,t)  =  /?(s,£),  then 

/oo  roo 

ds  J  dt  (8) 

Similarly,  the  integral  over  region  B  is 

/0  rOO 

ds  /  dt  Cx(s,t)e^SUJl+tiJ2K  (9) 

-oo  J  0 

Using  the  symmetries  in  equation  (7),  and  changing  the  variables  of  integration, 
one  can  show  that 


Pb(^i,  0^2)  — +  ^2)^2)-  (10) 

Similar  relationships  can  be  derived  between  PA  and  the  Fourier  integrals  over  the 
other  regions  in  figure  1.  The  bispectrum  can  then  be  calculated  by  summing  all 
the  integrals.  The  result  is 

P((Ji,u2)  =  Pa(cji,w2)  +  Pa(-(v  1  +^2)^2)  +  Pa(^i,-(^i  +u>2)),  (11) 

where  the  hat  operation  ?  denotes  symmetrisation: 

f(x>  y)  =  /(*,  y)  +  f(y,  x).  (12) 

3  The  generalised  sawtooth  map 

3.1  The  dynamical  system 

Our  first  dynamical  system  is  defined  by  the  mapping  of  the  interval  (0, 1)  onto 
itself 

=  {  Lxxibb  (i3) 

as  pictured  in  Figure  2.  The  dynamical  system  xt+\  ~  f(xt)  has  a  uniform  invariant 
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distribution.  It  is  chaotic,  with  Lyapounov  exponent 

A  =  —b In  b  —  (1  —  b)  ln(l  —  b)  (14) 

easily  calculated  by  standard  methods.14 

This  dynamical  system  is  a  generalisation  of  the  sawtooth  or  Renyi  map  which 
has  been  studied  by  many  authors.14  When  b  =  1/2,  the  two  systems  are  identical. 
This  generalisation  makes  the  study  of  the  system  more  difficult,  but  it  is  necessary 
in  this  case  because  when  b  =  1/2,  the  system’s  bispectrum  is  zero.  This  is  an 
effect  of  the  symmetry  of  the  function  /.  Sakai  and  Tokumaru15  have  calculated 
the  autocorrelation  function  of  the  b  =  1/2  system  using  a  relationship  between 
this  system  and  an  autoregressive  process. 

3.2  Symbolic  dynamics 

The  method  we  use  to  study  this  system  is  symbolic  dynamics  14 ,  where  a  sequence 
of  symbols  from  a  discrete  and  finite  alphabet  represents  a  trajectory  of  a  dynamical 
system.  The  representation  for  the  generalised  sawtooth  map  is  not  difficult  to 
derive.  Suppose  that  x0  and  x\  are  two  successive  points  in  the  trajectory  of  the 
system,  so  that  x\  -  f(xo).  Then  we  have  either 

x0  =  bx  i  (15) 

or,  if  x0  >  5, 

x0  =  6+  (1  -  b)xi.  (16) 

Putting  Z\  —  0  in  the  first  case,  and  z\  =  1  in  the  second,  we  can  write 

x0  =  zib  +  xiei,  (17) 

where  e*  =  (b  -  2 zfi  +  Z{).  Continuing  along  the  same  lines,  we  obtain 
Xq  -  zib  +  ei(z2b  +  e2(z3b  +  63X3)) 

n 

=  be ie2  *  ••en-1xn  -\-b^eie2  ■  -ek~i zk-  (18) 

fc=i 

It  is  easy  to  show  that,  under  the  (uniform)  invariant  measure,  the  Z{  are  indepen¬ 
dent  random  variables,  taking  the  values  0  or  1  with  probabilities  b  and  (1  —  5) 
respectively.  We  can  therefore  use  this  representation  to  calculate  various  averages. 
For  example,  in  the  next  section  we  calculate  the  autocorrelation  function  and  the 
power  spectrum. 

3.3  The  power  spectrum 
Multiplying  equation  (18)  by  xn,  we  obtain 

n 

x0xn  =  be ie2  •  •  •  en-\x2n  +  b  ^  eie2  ■  •  •  ek-\zkxn. 

k= 1 


(19) 
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We  can  now  take  the  expectation  value,  using  the  fact  that  random  variables  with 
different  subscripts  are  independent,  and  the  expectation  values  { Zi )  =  (1-6), 
(z{)  =  (1  -  6),  (xi)  =  1/2,  (xj)  =  1/3  and 


<ei)  =  62(l-6)2  =  B,  (20) 

(ei)  =  b3  +  (1  -  6) 3  =  C,  (21) 

(22) 

After  summing  a  geometric  series,  we  obtain 

1  Bn 

(x0xn)  =  -  +  — .  (23) 


This  equation  is  valid  for  n  >  0.  The  autocorrelation  function  is  therefore 


Cx{t)  =  (x0xt)  -  {xl)  =  — .  (24) 

The  power  spectrum  is  the  Fourier  transform  of  this  function  (using  the  convention 
of  equation  (1)): 


P(v)  = 


1  +  52 

12(l  +  52  -25COS0)* 


(25) 


3.4  The  bispectrum 

The  same  methods  can  be  used  to  calculate  the  three-point  average  (xo xsxt)  when 
0  <  s  <  t.  We  can  then  calculate  the  bicorrelation  function.  The  details  of  this 
calculation  occupy  far  too  much  space  to  be  recorded  here.  The  result  is,  again  for 
0  <s<t, 


cx ( 8,t)  =  (Bl  —  BsCt_s) ,  (26) 

where  B  and  C  are  defined  in  equations  (20)  and  (21).  The  bispectrum  can  now 
be  calculated  using  the  method  described  in  the  last  section.  The  result  is  given  by 
equation  (11),  with 


Pa{x,v) 


{2b  -  l)(b  -  l)be~ix 
12  (1  -  Be~ix )  (1  -  Ce~ix)  (1  -  Be*v) 


(27) 


Contour  diagrams  of  the  bispectrum  and  bicorrelation  function  for  a  typical 
value  of  b  are  shown  in  figure  3.  The  ‘ridges’  found  by  Subba  Rao8  are  not  evident 
for  this  system  for  any  value  of  b  that  we  have  examined.  The  most  noticeable 
consistent  feature  is  that  the  energy  in  the  bispectrum  is  spread  over  a  broad 
ranges  of  frequencies  rather  than  being  concentrated  into  narrow  bands.  This  may 
be  a  characteristic  of  fully- developed  chaos. 
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Figure  3.  The  bispectrum  for  the  sawtooth  map  with  b  —  0.27.  The  left  graph  shows  a  surface 
map  of  the  modulus  of  the  bispectrum,  evaluated  over  the  region  where  and  u>2  are  positive. 
The  right  graph  shows  a  direction  field  displaying  the  complex  phase  of  the  bispectrum  overlaid 
with  a  contour  map,  evaluated  over  both  positive  and  negative  frequencies. 


4  Intermittent  dynamical  systems 

In  this  section,  we  study  the  bispectra  of  a  class  of  intermittent  dynamical  systems. 
For  definiteness,  we  take  as  our  model  the  system  defined  by  the  mapping 

f(x)  =  (x  +  ijlxz)  mod  1,  (28) 


shown  in  figure  2  for  fi  =  1,  z  =  1.7.  The  analysis  given  here  depends  only  on  the 
form  of  f(x)  close  to  the  intermittent  point  at  x  —  0,  and  so  applies  to  a  broad 
class  of  intermittent  dynamical  systems,14  as  described  by  Ben-Mizrachi  et  al16 
The  defining  characteristic  of  these  intermittent  systems  is  that  they  show  brief 
periods  of  random  behaviour  between  long  periods  when  the  system  is  ‘stagnant’, 
remaining  close  to  the  centre  of  intermittency  (in  the  case  of  the  model  system 
above,  close  to  the  point  x  =  0). 

We  will  be  studying  the  long-time  behaviour,  or  equivalently  the  low-frequency 
behaviour,  of  these  dynamical  systems.  For  this  purpose,  following  Ben-Mizrachi  et 
al ,16  we  replace  the  dynamical  system’s  time  series  xt  by  a  series  of  delta-functions 
located  at  the  times  of  escape  from  periods  of  stagnation.  The  dynamical  system 
is  thus  replaced  by  a  renewal  process  with  waiting-time  distribution  defined  by  the 
distribution  of  waiting  times  between  escape  events.  As  Ben-Mizrachi  et  al 16  show, 
this  distribution  can  be  approximated  for  large  r  by 


Pt(t) 


(a  -  1  )t~a  t  >  1 
0  t<  1, 


(29) 


where  a  =  z/(z  —  1).  Ben-Mizrachi  et  al 16  also  show  that  the  function  c(t)  giving 
the  probability  of  an  escape  event  at  time  t  given  the  occurrence  of  such  an  event 
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at  time  0  is  given  by 


c(s)  = 


1 

1  -Pr(«)’ 


(30) 


where  the  bar  .  denotes  the  Laplace  transform.  The  Laplace  transform  of  pT  is  an 
incomplete  Gamma  function,17  whose  small-5  behaviour  is  given  by 


l~pr(s)^sa-1F(2-a)  (31) 

for  1  <  a  <  2.  For  the  sake  of  brevity,  we  consider  only  this  case  (corresponding 
to  z  >  2)  in  this  paper.  Other  cases  will  be  covered  in  a  longer  paper  currently  in 
preparation. 

For  the  case  1  <  a  <  2,  for  small  s  we  have  c(s)  ~  T(2  —  a)s1“Q!,  and  the 
low-frequency  behaviour  of  the  power  spectrum16  is  given  by 


P(oo)  =  | c{iuS)  +  c(—iu) |2  ~ 


constant 

cj2q;— 2 


(32) 


The  bispectrum  can  also  be  calculated  from  c(s).  First,  note  that  for  large  £i 
and  (£2  —  £1 )  and  for  0  <  £1  <  £2 , 


and 


(xtlx0)  ~  c(oo)c(£i). 


(33) 


(xtlxt2x 0)  -  c(oo)c(£!)c(£2  -  £1).  (34) 

The  bicorrelation  Cx(ti,t2)  is  therefore  given  by 

cx(tut2)  =  c(oo)c(<i)c(£2  -<i)  -c(oo)2  (c{t1)  +  c(t2)+c{t2-t1))  +  2c(oo)3.  (35) 

After  applying  the  double  Laplace  transform  and  taking  the  limit  s  — >  0,  we  find 
that  the  first  term  dominates  and 


Cx(si , s2) 


c(oo) 


T2(2  —  a)sf_1  (s2  — 


(36) 


By  setting  s±  —  iu)\ ,  52  —  iw2j  we  obtain  an  expression  for  the  bispectrum  integral 
Pa' 


Pa(w  1,^2)  ~ 


c(oo) 

r2(2  —  a)  (itj!  (<jJ2  —  u;i))a_1 


(37) 


The  bispectrum  is  then  given  by  equation  (11). 

Bispectra  calculated  numerically  from  time  series  generated  directly  by  the  dy¬ 
namical  system  (28)  are  in  good  agreement  with  this  result,  although  a  large  number 
of  data  points  is  required  for  the  average  to  converge  to  a  smooth  function.  Figure 
4  shows  the  result  of  one  comparison. 

The  structure  of  these  low-frequency  bispectra  is  simple.  They  show  power-law 
singularities  at  the  lines  ui  =  0,  u2  —  0  and  u\  +  uj2  =  0.  Like  the  bispectra  for 
the  sawtooth  map,  the  bispectra  for  intermittent  systems  (at  least  at  low  frequen¬ 
cies)  show  no  isolated  peaks.  Intensity  extends  across  a  wide  range  of  frequencies 
modulated  by  a  power-law  envelope. 


73 


Figure  4.  The  theoretical  treatment  given  in  this  paper  predicts  that  the  function  P(wi,0)  should 
diverge  as  o?“2^1_a^  as  0.  This  graph  shows  the  theory  (solid  line)  and  values  of  a  numeri¬ 

cally  calculated  bispectrum  using  10000  data  points  of  a  time  series  generated  by  the  dynamical 
system  of  equation  (28)  for  z  =  2.1  (a  =  1.909). 


These  results  suggest  that  bispectral  measurements  may  be  useful  in  cases  where 
it  is  difficult  to  distinguish  power-law  noise  from  a  dynamical  system16,2  from  noise 
from  a  truly  random  source,  such  as  filtered  white  noise.  White  noise  retains  its 
random  phases  when  passed  through  a  linear  filter,  and  so  has  zero  bispectrum. 

5  Bispectra  of  financial  time  series 

The  bispectra  of  economic  time  series  were  first  computed  by  Godfrey,18  who  was 
able  to  reject  the  null  hypothesis  of  linearity  for  a  number  of  stock  price  time  series, 
but  did  not  discuss  other  aspects  of  the  bispectral  form.  For  comparison  with  the 
other  examples  presented  here,  we  calculated  the  bispectrum  of  the  log-increments 
yn  =  \og(xn/xn-i)  of  the  Dow  Jones  Industrial  Average  stock  market  index,  eval¬ 
uated  daily  over  the  period  January  1994  to  February  1999.  The  bispectrum  was 
estimated  from  the  bicorrelation  using  a  Parzen  window,  as  in  the  work  of  Subba 
Rao  and  Gabr7.  The  result  (figure  5)  shows  broad  spectral  intensity,  as  for  the  two 
chaotic  examples  presented  here,  but  concentrated  more  at  higher  frequencies.  This 
shift  towards  higher  frequencies  is  probably  a  result  of  taking  the  log-increments, 
which  is  effectively  a  differentiation  process. 

6  Conclusion 

One  recognised  characteristic  of  chaotic  dynamical  systems  is  that  they  show  a 
continuous  power  spectrum,  with  intensity  distributed  across  a  wide  range  of  fre¬ 
quencies.  The  results  of  this  paper  show  that,  at  least  in  some  cases,  this  is  also 
true  of  their  bispectra.  This  provides  a  way  of  distinguishing  a  chaotic  time  series 
from  filtered  white  noise.  However,  other  classes  of  processes  share  this  charac¬ 
teristic  of  broad-band  bispectral  intensity.  This  is  clear  because  the  bispectra  for 
the  intermittent  systems  in  this  paper  were  calculated  by  exploiting  the  similarity 
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Figure  5.  The  left-hand  graph  shows  the  modulus  of  the  bispectrum  for  the  log-increments  of  the 
Dow  Jones  industrial  average  stock  index  (values  shown  on  the  right). 


between  these  systems  and  renewal  processes.  A  broad-band  bispectrum  can  not 
therefore  be  taken  as  an  unambiguous  sign  of  chaos.  A  broad-band  bispectrum 
does  indicate  interactions  between  Fourier  modes  over  a  wide  range  of  frequencies, 
and  is  therefore  a  sign  that  the  system  studied  does  not  have  a  simple  description 
in  the  Fourier  domain.  The  broad-band  character  of  the  bispectra  of  financial  time 
series  should  therefore  be  taken  as  an  indication  of  the  complexity  of  the  underlying 
processes. 

The  authors  would  like  to  thank  Abby  Evans,  Martin  Turner  and 
Ursula  Augsdorfer  for  help  with  the  production  of  this  paper. 
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A  simple  evolutionary  model  is  introduced  for  neural  development  along  the  lines 
of  the  Bak-Sneppen  model  for  biological  evolution  of  an  ecology.  The  model  rep¬ 
resents  a  set  of  neurons  and  their  connections  together  with  associated  synaptic 
weights.  Evolution  of  the  system  is  studied  for  different  model  fitness  functions 
of  the  synaptic  weights.  The  model  systems  exhibit  Darwinian  evolution  of  the 
synaptic  weight  space  towards  maturation. 


1  Introduction 

Until  recently  the  biological  contribution  to  learning  was  thought  to  be  an  unfolding 
according  to  an  intrinsic  schedule  lj2.  However,  evidence  now  indicates  that  the 
developing  cerebral  cortex  is  largely  free  of  domain-specific  structure  3,4  and  the 
representational  properties  of  the  cortex  are  built  by  the  nature  of  the  problems 
confronting  it. 

At  the  neurobiological  level,  learning  stems  from  the  interaction  between  in¬ 
trinsic  growth  and  environmentally  derived  activity  4.  Two  factors  that  are  fun¬ 
damental  for  this  interaction  are  selection  and  variance.  Selection  5’6,7,8  has  two 
distinct  stages.  The  first  stage  constructs  “pre-representations”.  The  second  stage 
selectively  eliminates  certain  of  these  representations.  The  most  fit  representations 
survive  to  underlie  mature  skills. 

Variation  is  important  for  the  development  of  a  maximally  flexible  representa¬ 
tion  capacity.  The  florid  growth  of  neural  tissues  in  ontogeny  4:  synapses;  axonal 
and  dendritic  arborisation  represents  one  possibility  for  introducing  a  significant 
chance  element  to  learning.  Representations  are  consolidated  by  a  gradual  increase 
in  the  synaptic  weights  of  preferred  units  (synaptic  weight  space).  The  widely 
accepted  mechanism  for  such  consolidation  is  Hebbian  learning  via  positive  rein¬ 
forcement.  At  the  neural  net  level,  Hebbian  learning  follows  from  the  covariance 
of  pre-  and  post-synaptic  discharges.  At  the  circuit  level,  it  involves  synergy  of 
oscillators  involved  in  common  function.  Synapses  which  do  not  participate  in  the 
maintenance  of  circuits  are  eliminated. 

Selectivity  and  variability  are  also  important  aspects  of  biological  evolution. 
One  of  the  simplest  models  for  biological  evolution  is  the  Bak-Sneppen  model  9. 
In  this  model  evolutionary  activity  is  simulated  through  random  mutations  of  the 
least  fit  species  and  its  neighbours.  A  characteristic  feature  of  this  model  is  that  it 
exhibits  Self- Organized  Criticality  whereby  the  system  evolves  through  a  succession 
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of  punctuated  equilibria  to  a  state  where  almost  all  species  have  fitness  above  a 
threshold  level. 

In  this  paper  we  have  introduced  new  variants  of  the  Bak-Sneppen  model  to 
investigate  the  evolution  of  synaptic  weight  space  towards  maturation  in  simple 
models  for  neural  development. 

2  Neural  Connectivity  Model 

Consider  a  set  of  n  units  (neurons)  labelled  1,2,3,  ...,n  on  a  periodic  one¬ 
dimensional  lattice,  i.e.,  units  k  -f  1  and  k  —  1  are  neighbours  of  unit  k  which 
is  equivalent  to  unit  n  +  Jc.  Associated  with  each  unit  k  we  identify  a  set  of  c(k) 
connections  (synapses).  Two  versions  of  the  models  are  studied.  One  in  which  the 
number  of  connections  is  kept  fixed  (c(k)  —  m)  and  the  other  in  which  the  number 
of  connections  (for  the  least  fit  unit  and  its  two  neighbours)  is  selected  at  random, 
(c(k)  €  [1,  m])  at  each  update  step.  We  have  in  mind  that  the  former  situation  will 
be  easier  to  investigate  theoretically  whereas  the  latter  will  be  more  representative 
of  a  model  neural  system.  Let  Cj(k)  denote  the  ^’th  connection  associated  with  unit 
k  and  identify  a  corresponding  synaptic  weight  a  <  Wj  ( k )  <  b.  Pre-representations 
are  introduced  by  choosing  the  synaptic  weight  for  each  connection  at  random  from 
a  uniform  distribution  in  the  range  [a,  b].  Initially  we  suppose  that  the  system  is 
fully  connected,  i.e.,  initially  c(k)  =  m  for  all  k.  There  is  no  discernible  difference 
in  the  long  term  results  whether  or  not  the  system  is  initially  randomly  connected. 
Selection  and  variation  in  the  system  is  modelled  by  replacing  the  least  fit  unit 
and  its  two  neighbours  by  new  units  with  new  randomly  assigned  connections  and 
weights. 

Three  different  models  have  been  examined.  In  Model  A  the  weights  are  random 
numbers  in  the  range  [0, 1]  and  the  fitness  of  a  unit  is  defined  as  the  minimum  weight 
for  that  unit.  In  Model  B  the  weights  are  random  numbers  in  the  range  [—1,1]  and 
the  fitness  of  a  unit  is  defined  as  the  average  of  the  weights  for  that  unit.  In  Model 
C  the  weights  are  again  random  numbers  in  the  range  [-1,1]  but  the  fitness  is 
defined  as  the  sum  of  the  weights.  The  choice  of  the  range  [—1,1]  in  the  latter  two 
models  is  to  include  the  possibility  of  both  excitatory  and  inhibitory  synapses.  In 
each  model  the  steps  in  the  update  procedure  are  as  follows: 

1.  Set  up  the  pre- representations  which  are  fully  connected  with  random  synaptic 
weights. 

2.  Calculate  the  fitness  for  each  unit  and  identify  the  unit  with  the  lowest  fitness. 

3.  For  the  least  fit  unit  and  each  of  its  neighbouring  units  reset  the  number  of 
connections  depending  on  the  variant  of  the  model  and  assign  new  weights  at 
random  from  a  uniform  distribution  on  [a,  b]. 

4.  Return  to  step  2  and  continue  for  N  updates. 

Model  A  with  m  —  1  is  the  Bak-Sneppen  model.  The  transient  behaviour  of 
this  model  is  characterized  by  punctuated  equilibria  as  the  system  evolves  towards 
a  statistically  stationary  state  in  which  the  density  of  weights  in  the  system  with 
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Figure  1.  Snapshot  of  the  stationary  state  for  Model  A  with  m  =  2  and  fixed  numbers  of  connec¬ 
tions. 


w  <  w*  vanishes.  The  critical  weight  w*  ~  0.66702 ±0.00003  10  is  referred  to  as  the 
self-organized  threshold.  A  plot  of  the  largest  value  of  the  minimum  weight  after 
s  updates  versus  s  reveals  a  staircase  structure  where  the  average  length  of  a  step 
in  the  staircase  scales  as  a  power  law  distribution.  The  change  in  the  minimum 
weight  across  a  level  step  in  the  staircase  is  referred  to  as  an  avalanche  9. 

3  Simulations 

Simulations  of  the  models  have  been  carried  out  for  n  —  100  units  and  s  =  106 
updates  over  a  range  of  values  of  the  maximum  number  of  connections  m  G  [1, 210]; 
both  for  fixed  number  of  connections  and  for  random  numbers  of  connections  (up  to 
the  maximum  m).  To  facilitate  the  discussion  of  these  results  let  /m(s)  denote  the 
fitness  of  the  least  fit  unit  after  the  sth  update  in  a  model  where  m  is  the  maximum 
number  of  connections. 

In  all  cases  the  transient  behaviour  exhibits  punctuated  equilibria  and  the  long 
term  behaviour  is  highly  correlated  with  almost  all  fitness  values  above  a  critical 
value.  Figure  1  shows  the  weights  for  each  unit  plotted  against  the  unit  number  in 
the  ‘stationary  state’,  after  104  updates,  in  Model  A  with  m  =  2  and  the  number 
of  connections  fixed.  Except  for  the  localized  avalanche,  all  weights  are  above 
the  limiting  threshold  weight.  Similar  long  term  trends  are  found  in  all  cases 
independent  of  m  (compare  for  example,  Figure  1  of  11  which  shows  a  similar 
plot  for  the  original  Bak-Sneppen  model),  however  the  magnitude  of  the  limiting 
threshold  fitness  is  m  dependent. 

The  transient  behaviours  with  their  characteristic  punctuated  equilibria  are 
shown  for  each  of  the  models  in  Figures  2a, 2b, 3a, 3b, 3c.  These  figures  show  plots  of 
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(a) 


(b) 


Figure  2.  Devil’s  staircase  structure  for  Model  A;  (a)  fixed  numbers  of  connections,  (b)  random 
numbers  of  connections. 


the  maximum  value  of  the  minimum  fitness  after  s  updates, 

A  (m,s)  =  max/m(s), 

[0  ,s] 

versus  s.  In  each  horizontal  step  of  the  staircase  structure  in  these  figures  the 
minimum  fitness  remains  less  than  this  threshold  value.  A  rise  in  the  staircase 
occurs  when  the  minimum  fitness  exceeds  this  threshold  value.  The  case  of  Model 
B  with  fixed  numbers  of  connections  was  not  simulated  because  this  is  the  same 
as  Model  C  with  fixed  numbers  of  connections  apart  from  the  scale  factor  m  (see 
further  comments  below). 

In  Models  A  and  B  the  limiting  threshold  fitness, 

A*(ra)  =  lim  max/m(s), 

s— >oo  [0,s] 

decreases  with  an  increase  in  the  maximum  number  of  connections  m.  In  Model  C 
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Figure  3.  Devil’s  staircase  structure;  (a)  Model  B  with  random  numbers  of  connections;  (b)  Model 
C  with  fixed  numbers  of  connections,  (c)  Model  C  with  random  numbers  of  connections. 
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Figure  4.  Plot  of  the  limiting  threshold  fitness  versus  the  maximum  number  of  connections  m:  (a) 
Model  A  with  fixed  numbers  of  connections;  (b)  Model  A  with  random  numbers  of  connections; 
(c)  Model  B  with  random  numbers  of  connections;  (d)  Model  C  with  fixed  numbers  of  connections; 
(e)  Model  C  with  random  numbers  of  connections. 


the  opposite  is  true.  Figure  4  shows  plots  of  the  limiting  threshold  fitness  A*(m) 
versus  m  for  each  of  the  models.  The  error  bars  in  the  plots  are  based  on  data 
from  ten  runs  of  each  model  and  the  lines  are  ensemble  averages.  From  these  plots 
we  note  that  for  large  m:  A *(m)  ~  ~  -  for  Model  A;  A *(m)  ~  -  for  Model 

B;  and  A*(m)  ~  y/rn  -  for  Model  C.  For  Model  A  the  limiting  value  A*(l)  is  the 
Bak-Sneppen  self- organized  criticality  threshold  value  0.66702  . . ..  For  models  B 
and  C  the  limiting  value  A*(l)  «  2  x  0.66702  —  1  =  0.33404. 

4  Scaling  Analysis 

The  functional  relationships  between  the  limiting  threshold  fitness  (the  self- 
organized  threshold)  and  the  maximum  numbers  of  connections  can  be  obtained 
using  simple  probabilistic  arguments.  To  this  end  (following  12)  we  first  define  the 
avalanche  probability  function,  F\(m)(s)  as  the  probability  for  an  avalanche  with 
threshold  fitness  A (m)  which  starts  at  k  =  0  to  end  at  k  =  s.  It  follows  immedi¬ 
ately  from  the  definition  that  P\(m)(0)  =  0.  In  Model  A,  the  probability  for  the 
avalanche  to  end  at  step  k- b  1  is 

PxA(m)(k  +  1)  =  (1  -  PxA(rn)(m  1  -  Mm))3™  (1) 

The  first  factor  on  the  right  hand  side  is  the  probability  that  the  avalanche  did 
not  stop  at  the  earlier  step  k  and  the  factor  (1  —  A^m))3™  is  the  probability  of 
independently  selecting  new  fitness  values  at  random  from  a  uniform  distribution 
for  each  of  the  3m  connections,  for  the  least  fit  unit  and  its  two  neighbours,  above 
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the  threshold  fitness  A^(ra).  It  is  a  straightforward  exercise  to  obtain  the  solution 
to  the  difference  equation,  Eq.(l).  The  solution  for  initial  condition  P\A(m)(ty  =  0 
is 

-  (-l)fc(l  -  \A{m)?™k).  (2) 

At  the  limiting  threshold  values  A *A(m)  we  anticipate  that  the  avalanche  proba¬ 
bilities  P\*A(m)(k)  are  independent  of  m.  In  particular  by  equating  Px-  = 

P\*  (i)(fc)  we  obtain  the  relation 

Ki(m)  =  1  —  (1  —  A^(l))™,  (3) 

where  A^(l)  is  the  limiting  threshold  fitness  for  the  Bak-Sneppen  model.  The 
agreement  between  Eq.  (3)  and  the  values  plotted  in  Figure  4  is  summarized  in 
Table  1. 

We  now  consider  the  scaling  relation  for  model  C.  In  this  case  the  avalanche 
probabilities  are  determined  by  the  equation 

Pxc(m){k  +  1)  =  (1  -  Pxc(m)(mi  -  $  (4) 

where 

/-“■’(■  t)*' 

The  second  factor  on  the  right  hand  side  of  Eq.(4)  follows  from  the  Central  Limit 
Theorem.  For  sufficiently  large  m,  the  sum  Y (m)  of  m  random  numbers  from  a 
uniform  distribution  with  zero  mean  and  variance  cr2  is  a  Gaussian  random  variable 
Z  with  zero  mean  and  variance  mcr2.  Hence  the  probability  of  randomly  selecting 
the  sum  Y (m)  >  A c(m)  is  equal  to  the  probability  of  selecting  the  Gaussian  random 
variable  Z  >  .  This  probability  is  given  by 

V  i/mcT  / 


The  solution  of  Eq.(4)  for  initial  condition  PAc(m)(0)  =  0  is 

£>»<«><*)  * - ■.  a  -(-•)*<!-  » 


A  c{m) 


The  avalanche  probabilities  given  by  Eq.(5)  are  approximations  based  on  the  Cen¬ 
tral  Limit  Theorem  which  holds  with  increasing  accuracy  as  m  increases.  The  ap¬ 
proximation  is  already  reasonable  for  m  >  2  but  it  does  not  hold  in  the  case  m  —  1 
where  the  sum  over  m  random  numbers  from  a  uniform  distribution  on  [-1, 1]  is 
simply  the  uniform  random  variable.  In  this  case  the  probability  for  choosing  a  ran¬ 
dom  variable  greater  than  Ac(l)  is  (1  —  Ac,(l))/2  so  that  the  avalanche  probability 
is  given  by 
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Table  1.  Comparison  between  limiting  threshold  fitness  values;  A(m)  obtained  from  the  numerical 
simulations,  and  A(m)  obtained  from  the  theoretical  scaling  relations  Eqs.(3,7).  The  percentage 
difference  between  these  values  is  also  shown. 


m 

A  i(m) 

%  diff. 

A  c(m) 

A£r(ra) 

%  diff. 

1 

0.67789 

0.67789 

0.00 

0.35751 

— 

— ■ 

2 

0.43259 

0.43245 

0.03 

0.39351 

0.39351 

0.00 

4 

0.24769 

0.24664 

0.42 

0.55863 

0.55650 

0.38 

8 

0.13330 

0.13203 

0.95 

0.77354 

0.78701 

1.74 

16 

0.06815 

0.06835 

0.29 

1.07521 

1.11300 

3.51 

32 

0.03501 

0.03478 

0.65 

1.52210 

1.57402 

3.41 

64 

0.01774 

0.01754 

1.13 

2.17175 

2.22601 

2.50 

128 

0.00884 

0.00881 

0.36 

3.01230 

3.14805 

4.51 

256 

0.00442 

0.00441 

0.29 

4.29448 

4.45201 

3.67 

512 

0.00222 

0.00221 

0.53 

6.09033 

6.29610 

3.38 

1024 

0.00111 

0.00110 

0.47 

8.49563 

8.90402 

4.81 

The  limiting  threshold  value  for  m  =  1  in  Model  C  is  given  by  A£(l)  =  2A^(1)  -  1. 
To  obtain  the  scaling  behaviour  with  m  of  the  limiting  threshold  values  A£(m)  for 
m  >  2  we  now  equate  Px^m)(k)  =  Pa*,(2)  W  which  yields 

Xb(m)  =  yj^\*c(2).  (7) 

The  good  agreement  between  the  A £(m)  values  plotted  in  Figure  4  for  m  >  2  and 
Eq.(7)  is  summarized  in  Table  1. 

The  scaling  behaviour  in  the  case  of  model  B  can  be  obtained  by  repeating  the 
analysis  for  model  C  but  replacing  4>  by  <t>  since  the  average 

of  m  random  numbers  from  a  uniform  distribution  with  zero  mean  and  variance 
a2  is  a  Gaussian  random  variable  with  zero  mean  and  variance  a2.  This  yields  the 
scaling  result 


A  *B(m)  = 


(8) 


5  Discussion 

The  simple  evolutionary  models  for  neural  development  introduced  in  this  paper 
attempt  to  model  the  chance  aspect  of  learning  via  Darwinian  evolution.  All  models 
exhibit  Darwinian  evolution  of  the  synaptic  weight  space  towards  maturation  where 
almost  all  neurons  have  fitness  levels  above  a  threshold  value. 

It  is  anticipated  that  the  maximum  number  of  connections  scales  with  memory 
and  learning.  With  this  interpretation  the  ‘fitness’  functions  in  each  of  the  models 
needs  some  clarification  since  from  Figure  4  and  the  scaling  analysis  above  we 
see  that  only  Model  C  exhibits  an  increase  of  fitness  with  increasing  memory  and 
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learning.  On  this  basis  the  fitness  function  in  Model  C  is  more  representative 
of  learning  and  memory  than  the  fitness  function  in  Models  A  and  B.  However  a 
fitness  function  that  scales  with  memory  and  learning  can  also  be  recovered  from  the 
present  fitness  function  in  models  A  and  B  simply  by  multiplying  by  an  appropriate 
monotonically  increasing  function  of  m.  For  example  multiply  by  m7  where;  7  >  1 
for  Model  A,  7  >  1/2  for  Model  B,  and  7  >  0  for  Model  C. 

It  is  interesting  that  the  scaling  relations:  \*A(m)  ~  1/m,  A^(m)  ~  1  / y/fn  &nd 
\*c(m)  ~  yfm  which  were  derived  in  the  theoretical  analysis  (and  obtained  numeri¬ 
cally)  for  the  case  of  fixed  numbers  of  connections  are  also  obtained  in  our  numerical 
simulations  when  the  numbers  of  connections  is  selected  at  random  (for  the  least 
fit  unit  and  its  neighbours)  in  each  update.  With  the  appropriate  interpretation 
of  fitness  as  above,  for  a  fixed  maximum  number  of  connections,  we  find  that  the 
neurons  self-organize  themselves  to  operate  at  increasing  fitness  whilst  at  the  same 
time  decreasing  the  numbers  of  active  connections.  The  scaling  analysis  of  this 
reduction  in  the  number  of  active  connections  with  increasing  fitness  is  deserving 
of  further  studies. 

Towards  the  end  of  this  study  we  became  aware  of  the  neuronal  model  of  self- 
organized  learning  recently  introduced  by  Chialvo  and  Bak  13 .  Our  models  are 
similar  to  their  model  to  the  extent  that  memory  and  learning  is  consolidated  via 
Darwinian  elimination  of  the  least  fit  units  rather  than  via  Hebbian  re-inforcement 
of  the  most  fit  units.  On  the  other  hand  our  models  are  more  crude  than  the 
Chialvo-Bak  model  since  we  do  not  include  the  dynamics  of  firing  in  our  models. 
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The  purpose  of  this  study  was  to  show  that  abnormal  fractal  correlation  and  irregularity  of  heart  rate 
behavior  were  altered  in  intrauterine  growth  restricted  fetuses  (IUGR  group)  and  fetuses  whose 
mothers  had  maternal  pregnancy  induced  hypertension  (PIH  group).  We  analyzed  fetal  heart  rate  data 
of  5000  points  in  normal  (n=98),  IUGR  (n=45),  and  PIH  (n=46)  fetuses,  with  their  gestational  ages 
>  38  weeks  and  without  any  perinatal  complication.  We  calculate  approximate  entropy  for  the 
quantifying  irregularity,  and  short-range  (<  80  beats,  cti)  and  long-range  (>80  beats,  a2)  fractal 
scaling  exponent  for  quantifying  the  fractal  correlation  properties.  We  also  performed  spectral 
analysis.  In  the  IUGR  group,  statistical,  the  spectral  measures,  a2,  and  were  significantly 
higher  and  the  approximate  entropy  was  significantly  lower  than  in  the  normal  group.  In  the  PIH 
group,  cti  was  significantly  lower,  and  a 2  and  a2/ai  were  significantly  higher.  The  fetuses  associated 
with  either  IUGR  or  PIH,  although  they  are  not  severely  compromised,  showed  abnormal  fractal 
and/or  irregular  heart  rate  behavior. 


1.  Introduction 

Antenatal  fetal  heart  rate  variability  recordings  have  been  widely  used  to  evaluate  fetal 
well-being.1  However,  the  conventional  indices  of  heart  rate  variability  such  as  mean, 
variance,  short-term  and  long-term  variability  have  shown  limitations2*3  in  providing 
information  about  the  cardiovascular  dynamics  of  healthy  or  diseased  fetuses.  Recently, 
newly  developed  nonlinear  dynamical  analysis  of  heart  rate  variability4*6  makes  possible 
the  uncovering  of  abnormal  heart  rate  behaviors  that  are  not  apparent  from  using 
conventional  measures.  This  technique  is  actively  used  in  early  detection  of 
cardiovascular  morbidity  and  prediction  of  mortality.7  9  However,  the  application  to  find 
abnormal  behavior  of  heart  rates  in  sick  fetuses  was  limited. 

Of  the  new  techniques,  fractal  correlation  and  irregularity  measures  of  heart  rate 
behavior  arouse  much  interest.8,10*11  The  fractal  measure  quantifies  the  temporal 
correlation  of  each  heart  rate  value  with  the  previous  one  on  short-  and  long-range  scales, 
and  thereby,  smoothness  or  roughness  of  the  landscape  in  the  heart  rate  time  series  on 
each  scale.12 
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The  aim  of  present  study  was  to  test  whether  uncomplicated  fetuses  associated  with 
intrauterine  growth  restriction  (IUGR)  and  pregnancy  induced  hypertension  (PIH),  which 
are  the  most  frequently  encountered  fetal  sicknesses,  had  abnormalities  in  fractal  and 
irregular  heart  rate  behavior. 


2.  Methods 

2.  1.  Subjects 

Among  the  pregnant  women  who  had  visited  the  outpatient  obstetric  clinic  of  Hanyang 
University  Hospital,  98  uncomplicated  cases  of  pregnant  women  and  45  cases  of 
pregnant  women  whose  fetuses  were  suspected  of  IUGR  and  were  confirmed  after 
delivery  were,  included  in  this  study.  We  also  included  the  46  fetuses  whose  mothers  had 
severe  PIH.  Severe  PIH  was  defined  when  one  or  more  of  the  following  criteria  were 
satisfied:  Blood  pressure  >  160/1 10  mmHg,  proteinuria  >  2+  on  reagent  strip,  oliguria  < 
400mL/24  hours,  elevated  liver  enzymes,  platelet  <  100,000/mm3,  and  pulmonary  edema. 
All  were  single  pregnancies,  with  gestational  ages  between  39  and  42  weeks  which  were 
calculated  using  the  date  of  the  last  menstrual  period  and  early  ultrasound  measures.  All 
births  were  without  any  major  malformations,  chromosomal  anomalies,  and  perinatal 
complication.  Using  growth  standards  of  Brenner  et  al.13,  birth  weights  of  infants  were 
determined  against  gestational  ages.  Infants  with  birth  weights  below  the  10th  percentile 
were  regarded  as  IUGR.  At  the  time  of  recording,  pregnant  women  in  labor  or  taking 
drugs,  including  smoking,  which  can  affect  the  fetal  heart  rate  variability,  were  excluded. 

2.2.  Data  collection 

All  subjects  were  in  a  semirecumbent  position  for  a  minimum  of  10  minutes  before  data 
collection.  Fetal  heart  rate  tracings  were  recorded  for  more  than  40  minutes  using  a 
Corometrics  115  (Corometrics,  Model  115,  USA)  external  fetal  monitor.  The  recorded 
data  were  sampled  into  a  personal  computer  with  a  digital  serial  interface.  Whenever 
missing  data  were  found,  they  were  recorded  as  zero.  Above  10  percent  signal  loss  in  the 
entire  data  have  been  excluded.  When  the  off-line  fetal  heart  rate  data  of  zero  (missing 
data)  or  below  30  beats  per  minute  or  above  200  beats  per  minute  were  encountered,  they 
were  removed.  We  selected  5000  points-data  which  corresponded  to  about  30  minutes  of 
recording  during  which  fetal  movements  were  included. 

2.5.  Power  spectral  analysis 

Power  spectral  analysis  was  performed  by  classical  fast  Fourier  transformation.14  We 
calculated  low-,  and  high-frequency  power  by  integrating  the  power  spectral  density 
curve  at  area  between  0.03  and  0.15  Hz,  and  0.15  and  0.4  Hz,  respectively.11 

2.4.  Detrended fluctuation  analysis 

To  quantify  the  fractal  correlation  behavior,  we  employed  the  detrended  fluctuation 
analysis,4,8,15  which  is  a  modification  of  classical  root-mean  square  analysis  of  a  random 
walk.  The  heart  rate  time  series  (length  N^5000)  was  first  integrated, 

y(k)  =I>(0-*oJ 
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where  x(i)is  the  ith  heart  rate  and  xave  is  the  average  heart  rate.  Next  the  integrated  time 
series  was  divided  into  boxes  of  equal  length,  n.  In  each  box  of  length  n,  a  least  squares 
line  was  fit  to  the  data.  The  y-coordinate  of  the  straight-line  segments  is  denoted  by  yn(k). 
Next,  we  detrended  the  integrated  time  series,  y(k),  by  subtracting  the  local  trend,  yn(k), 
in  each  box.  The  root-mean-square  fluctuation  of  this  integrated  and  detrended  time  series 
is  calculated  by 

F(n)  = 

V  M  *=l 

This  computation  was  repeated  over  all  time  scales  (box  sizes)  to  provide  a 
relationship  between  F(n)  and  n.  Typically  F(n)  will  increase  with  n.  A  linear  relationship 
on  a  double-log  graph  indicates  the  presence  of  scaling.  The  slope  of  the  line  relating  log 
F(n)  to  log  (n)  determines  the  fractal  scaling  exponent.  However,  in  most  of  the  fetuses' 
heart  rate  time  series,  we  found  that  log-log  plot  was  not  strictly  linear  but  rather 
consisted  of  two  distinct  linear  regions  of  different  slopes  separating  at  a  break  point  near 
80  beats  (Fig.  1 ,  Crossover  point). 


Normal  fetus  Intrauterine  growth  restricted  fetus 


Figure  1.  Plots  of  log  F(n)  vs  log  (n)  (see  text)  in  a  normal  and  a  growth  restricted  fetus  aged  40  weeks  of 
gestation.  For  both  fetuses,  detrended  fluctuation  analysis  curves  are  approximated  linear  over  two  regions,  with 
a  slope  at  for  small  values  of  n  (<80  heartbeats,  short-range  fractal  scaling  exponent)  and  a2  for  large  values  of 
n  (>80  heartbeats,  long-range  scaling  exponent),  resulting  in  apparent  crossover  phenomena.  Note  at  is  similar 
in  both  fetuses  (1.253  vs  1.248).  However,  a2  is  significantly  higher  in  the  growth  restricted  fetus  (1.092  vs 
0.652).  Arrows  indicate  crossover  points. 

Therefore,  fractal  correlation  of  heart  rate  was  defined  separately  for  short-range  (  < 
80  beats,  short-range  fractal  scaling  exponent,  aj)  and  long-range  (>  80  beats,  long-range 
fractal  scaling  exponent,  a2)  fluctuation  of  heart  rate  (Fig.l).  Crossover  index  was  defined 
as  ot2/ct]. 
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2. 5.  Approximate  entropy. 

The  methodological  details  for  computing  approximate  entropy,  ApEn(m,r,N),  have 
been  published  elsewhere5,8,11,16'17  and  will  be  described  briefly.  In  order  to  compute 
ApEn(m,r,N),  three  input  parameters  should  be  fixed  (m  :  the  length  of  compared  runs,  r  : 
the  effective  filter,  N  :  the  length  of  data  points).  Let  each  heartbeat  interval  data  set  be 
represented  as  x(i). 

ApEn(m,r,N)=average  over  i  of  loge[conditional  probability  that  |x(j+m)-x(i+m)|<r, 
given  that  |x(j+k)-x(i+k)|<  for  k=0,l,...,  m-1].  (1) 

From  x(i),  vector  sequences  u(l)  through  u(N-m+l)  are  formed,  defined  by 
u(i)=[x(i),...,x(i+m-l)].  These  vectors  represent  m  consecutive  x  values,  commencing 
with  the  ith  point.  Define  the  distance  d[u(i),  u(j)]  between  vector  u(i)  and  u(j)  as  the 
maximum  difference  in  their  respective  scalar  components.  Use  the  sequence  u(l), 
u(2),....,  u(N-m+l)  to  construct,  for  each  i<  N-m+1,  C™(r)=  (numbers  of  j  <  N-m+1 
such  that  d[u(i),  u(j)]^  r)/(N-m+l).  Define 


N  —  ftl  + 1  1  /  \ 

IoS eCi  (?) 


«>»  =  £ 


f  =  l 


N-m  +  1 


,  and  then  define  the  approximate  entropy  ApEn(m,r,N) 

N —m  i  m + 1  /  \  N —ni+ 1  i  ^ m  /  \ 

L  WJ  “  N-m  N -m  +  \ 


/= 1 

N-m, 


s  y  log,  cr'(r)~  log,  C(r)  =  ^  _  ylogjcrwcrfr)] 

■  ^  —  yyi  i  4  KT _ m 


/=1 


/=] 


which  equals  the  average  over  i  of 


log  e[cr\r)/c?{r)} 


This  last  parenthetical  expression  is  readily  seen  to  be  the  conditional  probability 
indicated  in  Eq.  (1).  For  this  study  N  =  5000  points  and  the  two  parameters  of  standard 
deviation  of  each  data  set  were  m=2  and  r=20%  standard  deviation  of  each  data  set.  Fig.  2 
plots  all  conditional  probabilities  over  i=l,2,...,5000, 


of  two  heart  rate  data  in  an  normal  and  an  IUGR  fetuses.  Approximate  entropy  was 
the  negative  of  the  average  logarithmic  conditional  probability,  i.e., 


y\oge{Cr\r)ICr(r) 


N-m 
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Normal  fetus 


Intrauterine  growth  restricted  fetus 


Figiure  2.  Plots  of  all  conditional  probabilities  for  calculating  the  approximate  entropy  of  the  two  fetuses  shown 
in  Fig.l,  which  is  the  negative  of  the  average  logarithmic  conditional  probabilities.  The  approximate  entropy  is 
much  less  in  the  growth  restricted  fetus  than  in  the  normal  fetus  (1.043  versus  0.532). 


2. 6.  Statistical  analysis . 

All  data  are  presented  as  mean  ±  1  standard  error  of  mean.  To  test  the  statistical 
difference  in  the  involved  parameters  between  the  three  groups,  a  procedure  for  general 
linear  model  (Statistical  Analysis  System  6.11)  was  used.  When  a  significant  statistical 
difference  in  the  parameter  among  the  three  groups  was  identified,  the  difference  between 
the  two  groups  was  compared  with  Duncan  test.  All  analyses  were  performed  using  an 
alpha  level  of  0.05  as  the  criterion  for  statistical  significance. 


3.  Results 


In  the  IUGR  group,  the  mean  birth  weight  was  2426  g  ±  30,  compared  to  3267  g  ±  22  in 
control  group  and  3134  g  ±  22  in  PIH  group  (p<0.05,  p<0.05).  The  results  of  statistical, 
spectral  and  nonlinear  analysis  are  summarized  in  table  1 . 

In  the  growth  restricted  group,  the  mean  and  the  variance  of  heart  rate  were 
significantly  higher  compared  to  those  of  normal  and  PIH  group.  The  frequency  domain 
measures,  low-  and  high-frequency  power,  were  also  significantly  higher.  However,  the 
approximate  entropy  was  significantly  lower.  Detrended  fluctuation  analysis  revealed  that 
all  three  groups  showed  apparent  crossovers  exhibited  for  the  fractal  scaling  behavior  of 
heart  rate.  The  emergence  of  the  crossover  phenomena  was  due  to  a  higher  short-range 
fractal  scaling  exponent  (a,)  compared  to  a  long-range  fractal  scaling  exponents  (a2)  and, 
therefore,  crossover  index  (a2/a.i)  was  far  less  than  unity.  The  coefficient  cti  of  the  IUGR 
group  was  not  significantly  different  from  that  of  the  normal  group.  However,  the  value 
of  ot2  and  a2/a,  were  significantly  higher  than  those  of  the  normal  group.  All  indexes 
except  the  fractal  measures  in  the  PIH  group  were  not  significantly  different  to  those  of 
normal  group,  al  was  significantly  lower,  and  a2  and  a2/cti  were  significantly  higher. 
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Table  1.  Statistical,  spectral,  and  nonlinear  indexes  of  heart  rate  in  normal  and  IUGR  group  for  entire 
gestational  periods. 

Data  are  represented  as  mean  ±  standard  error  of  mean.  IUGR:  uncomplicated  intrauterine  growth  restriction; 
PIH:  pregnancy  induced  hypertension;  bpm:  beats  per  minute;  *  <0.05,  normal  vs  IUGR;  fp<0.05  normal  vs 
PIH;  Jp<0.05,  IUGR  vs  PIH. 


Normal  group 
(n  =  98) 

IUGR  group 
(n  =  45) 

PIH  group 
(n  -  46) 

Mean  (bpm) 

141.7  +  0.8 

145.3  +  1.4*1 

142.4  ±  1.2 

Variance  (bpm2) 

47.2  ±3.5 

85.6  ±9.2*} 

47.8  ±6.4 

Low-frequency  power 
(msec2) 

132.8  ±9.5 

204.3  ±29.5** 

113.1  ±  1.0 

High-frequency  power 
(msec2) 

23.8  ±  1.6 

33.5  ±5.0*t 

22.9  ±2.1 

Approximate  entropy 

0.687  +  0.017 

0.578  ±0.023*4 

0.704  ±  0.028 

Short-range  scaling  exponent 
(a,) 

1.451  ±0.107 

1 .419  ±  0.122 

1.401  ±0.015f 

Long-range  scaling  exponent 
(Ob) 

0.713  +  0.022 

0.970  ±  0.028* 

0.915  ±0.0274 

Crossover  index 
(cb/a,) 

0.492  ±0.015 

0.689  ±  0.023* 

0.660  ±  0.0224 

4.  Discussion 


By  demonstrating  the  higher  long-range  fractal  scaling  exponent  and  crossover  index 
in  the  IUGR  and  PIH  group  without  any  perinatal  complication,  and  lower  short-range 
fractal  scaling  exponent  in  the  PIH  group,  it  was  uncovered  that,  although  they  were  not 
severely  compromised,  their  fractal  heart  rate  behaviors  were  abnormal.  By  showing  the 
decrease  in  approximate  entropy  in  the  IUGR  group,  it  was  also  revealed  that  their 
irregular  heart  rate  behavior  was  altered.  In  the  PIH  group,  unlike  in  the  IUGR  group, 
only  fractal  measures  could  differentiate  them  from  normal  group.  Furthermore,  to 
differentiate  the  two  groups  from  normal  one,  only  the  long-range  fractal  scale  exponent 
or  crossover  index  was  useful. 

The  approximate  entropy  quantifies  the  irregularity  in  the  time  series  by  measuring 
randomness  and  nonlinearity  in  it.5’16'17  Therefore,  one  distinguishing  feature  of  the  heart 
rate  behaviors  in  the  IUGR  fetuses  is  that  each  heart  rate  tends  to  overshoot  less  or 
undershoot  less  (decreased  nonlinearity)  and  to  change  less  in  random  directions 
(decreased  randomness). 
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The  fractal  scaling  exponent  measures  absolute  degree  of  temporal  correlation  and 
smoothness  or  roughness  in  heart  rate  fluctuation.4,8,12 

Although  the  average  value  of  short-range  fractal  scaling  exponent  in  the  PIH  group 
was  significantly  higher,  the  values  of  all  three  groups  were  near  1.5.  Therefore,  on  the 
short-range  scale,  their  heart  rate  resembles  smooth  Brownian  noise.  The  average  value  of 
the  long-range  fractal  scaling  exponent  in  the  normal  group  was  0.713,  which  is  near  0.5, 
and  over  long-range  scale,  the  landscape  of  the  heart  rate  time  series  resembles  rough 
white  noise,  whereas  the  value  in  the  IUGR  and  PIH  group  were  near  unity  and  their 
landscape  resembles  1/f  noise.  Therefore,  a  distinguishing  feature  of  their  heart  rate 
behavior  from  that  of  the  normal  group  is  that  on  the  long-range  scale  the  heart  rate 
fluctuation  is  more  timely  correlated  and  smoother.  The  increased  low-frequency 
oscillation  evidenced  by  increased  low-frequency  power  in  the  IUGR  group  (Table  1)  and 
their  possible  increase  in  the  duration  of  heart  rate  acceleration  that  occurred  during  fetal 
movement,  fetuses  might  account  for  the  increased  long-range  correlation  and 
smoothness.  However,  this  is  not  the  case,  since  the  long-range  fractal  scaling  behavior 
operates  at  80  -  1000  heartbeats,  whose  duration  is  beyond  the  period  of  the 
low-frequency  and  the  period  of  the  acceleration. 

The  abnormal  fractal  correlation  behavior  and/or  decreased  irregularity  in  the  heart 
rate  of  the  IUGR  and  PIH  group  may  reflect  the  abnormalities  in  integrated  complex 
cardiovascular  control,8,18  which  may  impair  the  fetuses'  ability  to  adapt  to  external  and 
internal  perturbations  and  predispose  the  fetuses  to  perinatal  mortality  and  morbidity. 
This  notion  is  supported  by  recent  observations  that  either  decreased  long-range  fractal 
scaling  exponent7’8*10  or  approximate  entropy1'1,22  in  heart  rate  dynamics  was  associated 
with  patients  with  a  variety  of  cardiovascular  diseases. 

In  conclusion,  by  demonstrating  abnormal  fractal  scaling  measures  and/or  decreased 
approximate  entropy  in  fetuses  associated  with  IUGR  and  PIH  and  without  any  perinatal 
complications,  we  found  that,  although  they  are  not  severely  compromised,  their  fractal 
and  irregular  heart  rate  behaviors  were  altered. 
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Many  natural  processes  can  be  characterized  by  their  scale-invariance  property. 
In  this  study,  we  present  the  results  of  potential  multiple  scalings  in  the  long¬ 
term  heart  rate  data  from  young  healthy  adults  subjected  to  normal  daily  activity. 
Our  approach  is  based  on  the  direct  check  of  the  probabilistic  structure  of  the 
increment  process.  Results  from  fractional  Brownian  motion  are  compared  and 
the  generating  mechanism  for  multiple  scaling  is  discussed  in  the  context  of  scale- 
invariance  formalism. 
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1  Introduction 

Many  physical  processes  are  scale  invariant,  giving  rise  to  renormalizable  structure 
and  self-similarity  in  space  and  time.  It  is  quite  often  that  the  renormalization 
can  be  conducted  at  different  scales  with  different  scaling  exponents,  namely,  a 
multiple  scaling  property.  Multiple  scaling  was  observed  in  many  natural  phenom¬ 
ena  ranging  from  physical1’2’3’4’5’6,  biological7’8’9’10,  to  economical  systems1’11’12. 
When  a  multiplicative  mechanism  is  involved  in  the  dynamics,  multifractal  theory 
can  be  used  to  characterize  a  (continuous)  set  of  singularity  exponents  and  the 
(Hausdorff)  dimension  of  their  supports  via  a  Legendre  transformation.  Successful 
applications  of  the  formalism  revealed  deep  insights  about  many  natural  processes 
such  as  the  density  profile  in  diffusion  limited  aggregates13’14,  the  velocity  and  dis¬ 
sipation  fields  in  fully  developed  turbulence15’2’3 ,  the  money  exchange  index  from 
financial  market1’11’12  and  network  traffic5’6.  In  general,  there  is  not  an  unified 
theory  for  multiple  scaling  and,  sometimes,  only  finite  number  of  scaling  exponents 
can  be  ascertained.  In  this  work,  we  will  present  such  an  example  in  the  long-term 
heart  rate  variability  (HRV)  from  healthy  young  adults  and  discuss  its  generating 
mechanism. 

The  study  of  HRV  has  continued  to  draw  great  interests  in  recent  years.  It  is 
of  both  fundamental  and  clinical  importance.  In  particular,  a  “healthy”  heart  typ- 
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ically  shows  a  power-law  like  power  spectrum16,  which  can  imply  scale  invariance0 
and  “self-similarity”  in  the  autonomous  nervous  system.  Losing  such  variability 
was  found  to  correlate  well  with  the  mortality  rate  of  heart  diseased  patients17. 
The  variation  in  the  HRV  scaling  has  been  discussed  by  a  number  of  researchers 
in  the  past  ,8,1°.  In  particular,  Peng  et.  al.  developed  the  systematic  method  and 
compared  the  power-law  scalings  in  the  very  short  time  scale  (a  few  heart  beats) 
with  the  asymptotic”  behaviour  (above  1000  heart  beats).  Significant  difference 
was  concluded  in  these  widely  separated  time  scales  and  the  characteristic  was 
found  sufficient  to  distinguish  between  the  healthy  subjects  and  patients  with  con¬ 
gestive  heart  failure10.  Di  Rienzo  et.  al.  also  noted  the  variation  of  the  power-law 
exponent  of  HRV  in  frequency,  which  in  turns  causes  variations  of  the  power-law 
scaling  of  the  blood  pressure  as  well7.  Later,  Viswanathan  et.  al.  tackled  the  non¬ 
uniformity  of  power  law  scaling  between  the  healthy  subjects  and  those  with  severe 
heart  disease  from  a  point-process  aspect18.  Again,  the  conclusion  was  drawn  on 
the  distinguishability  between  the  health  and  disease.  In  these  previous  studies, 
multiple  scaling  within  physiologically  relevant  range  of  healthy  humans  has  not 
been  explored. 

In  this  work,  multiple  scaling  was  studied  based  on  the  data  increment  process. 
The  increment  process  at  various  time  lags  enables  us  to  focus  on  the  local  scale 
invariance  property  of  the  data.  The  numerical  method  was  developed  and  tested 
on  artificial  time  series  with  one-  and  multiple-  scaling  exponents  and  then  applied 
to  the  heart  rate  data  set.  We  also  compared  with  the  existing  methodology  in 
the  literature  and  found  noticeable  difference  in  the  result.  This  paper  is  organized 
as  follows.  The  main  ideas  and  the  numerical  method  are  introduced  in  the  next 
section.  The  results  of  artificial  time  series  and  the  evidence  of  multiple  scaling 
in  heart  rate  data  are  presented  in  section  3.  In  the  last  section,  the  mechanism 
generating  the  multiple  scaling  in  HRV  will  be  discussed  in  the  context  of  scale 
invariance  formalism. 

2  Extract  Scale  Invariance  from  the  Probability  Density  Function 

In  this  section,  we  will  first  recall  the  basic  definition  of  scale  invariance  of  a  pro¬ 
cess  and  then  describe  the  numerical  method  to  extract  multiple  scaling.  Given  a 
time  series,  r(s),  its  scale  invariance  is  defined  by  the  family  of  probability  density 
functions  of  the  increment,  Ar(t;  s)  =  r(f+a)  -r(s):  for  any  A  and  a  fixed  constant 
h,  one  has 


fxt(n)  =  \-hft(\~hn)  (1) 

where  ft  denotes  the  probability  density  function  of  n  =  Ar(i;  s)  and  the  constant 
h  is  the  scaling  index  or  scaling  exponent  of  r(s). 

To  study  (1)  numerically  requires  the  assumption  of  stationarity.  This  appears 
to  hold  in  our  application.  When  the  increment  is  stationary,  ft(n)  can  then  be 
estimated  from  the  ensemble  of  {Ar(£;s),s  =  1,2,--  •}.  Due  to  the  data  fluctua¬ 
tion,  it  was  found  more  efficient  to  fit  ft(n)  with  a  specific  form,  say  gt(n),  than 


“Note  that  power-law  spectrum  is  only  a  necessary  condition19,20. 
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Figure  1.  (a)  The  set  Af(T;  h)  plotted  on  the  h  -  n  plane  for  T  =  {2*,  k  =  0, 1*2,3}  and 
h0  =  0.5,  dark  pixels:  |A7|  =  0,  white  pixels:  |A7|  >  0,  (b)  ju(T \h)  vs.  h  plot.  The  family 
of  density  functions  used  in  this  demonstration  was  ideal  in  that  they  have  the  Gaussian  form: 
{5t(n)  =  exp(-n2/2<r(t)2)/^/27rcr(f)2,  a(t)  =  Vi]  t  =  2k ,  k  =  0,  •  •  ■  11}. 


working  directly  with  the  histogram  of  Ar (t)  in  (1).  For  example,  for  the  fractional 
Brownian  motion,  a  Gaussian  form  is  assumed  for  #t(n).  In  this  case,  the  maximum 
likelihood  method  was  used  to  minimize  the  likelihood  function  —  log(II^t(ni))  in 
order  to  extract  the  mean  and  variance  of  a  Gaussian  probability  density  function. 
This  approach  is  subjected  to  less  bias  comparing  to,  say,  minimizing  the  L2  norm 
between  the  histogram  and  gt{n)21.  After  gt(n)  is  defined,  (1)  can  be  studied  by 
systematically  varying  n,  A  and  hQ  values.  Let 


w(n,  A;  h ) 


f\t(n) 

ft(X~hnY 


(2) 


and  denote  the  estimated  slope,  dlog(V)/dlog(A),  as  ti .  r  is  said  to  be  renormaliz- 
able  or  is  scale  invariant  with  a  scaling  exponent  h0  if  h!  ~  hQ. 

The  parameters  used  in  the  numerical  experiment  were  i  =  1,  \  €  {2k,k  = 
0,  •  •  ■ ,  11}  and  h  G  {0.01  -  fc,  k  =  0,  •  •  • ,  100}.  The  estimated  slope  h'  is  in  general 
a  function  of  n,  h,  and  A.  The  range  of  A  in  which  hl  is  estimated  reveals  the 
time  scale  of  the  local  scale  invariance  property  of  r(s).  Once  the  parameters  are 
defined,  we  first  construct,  for  a  given  h,  J\f(T]h)  =  {n,  | A;  h)  —  h\  <  e}  with 
e  —  0.01  to  keep  the  error  between  h!  and  h  less  less  than  1%.  Here,  T  denotes 
the  set  of  A’s  where  the  condition  \hr  —  h\  <  e  is  satisfied.  For  any  given  h  and 
T,  A/*  contains  the  set  of  n’s  in  which  local  scale  invariance  is  defined.  The  scaling 
interval  of  h'  can  thus  be  calculated  by  the  logarithm  of  the  ratio  of  the  largest  and 
smallest  A  in  T.  For  each  fi,  (1)  also  contains  an  isolated  solution  (when  the  set 
J\f  has  only  one  element)  which  has  no  relevance  in  the  current  context  (Fig.  la). 
Fortunately,  these  isolated  solutions  form  a  small  set.  Hence,  given  any  T  and  fi, 
fjt( T;  h)  =  |A/*(T;  h) |,  where  |  •  |  returns  the  number  of  elements  at  a  specific  h  value, 
will  show  a  peak  at  the  desired  scaling  exponent  (Fig.  lb). 


Figure  2.  (a)  Typical  RR-interval  record  (r(s)  (sec)  vs.  s  (xlO3  beat)),  (b)  the  power  spectral 
density  function  of  r{s)  (in  log-log  scale),  (c)  histogram  of  Ar (t),i  =  4  (in  linear-log  scale). 


3  Numerical  Evidence  of  Multiple  Scaling  in  HRV 

3. 1  Experimental  Procedure 

Six  young  adults  (average  age:  25  yr,  height:  173  cm,  weight:  74  kg)  participated  in 
the  experiment.  The  subjects  were  allowed  to  conduct  their  normal  daily  activity. 
The  difference  in  the  body  surface  potential  was  sampled  at  1000  Hz  for  a  period  of 
approximately  24  hours.  The  data  was  then  down-loaded  to  a  PC  and  a  specialized 
computer  code  was  written  to  search  for  the  QRS  complex  for  each  heart  beat 
with  proper  filtering  for  events  such  as  skip-beat,  PVC,  and  so  on.  The  time  span 
between  the  successive  contractions  of  the  ventricles,  measured  as  the  RR-interval 
(RRi) ,  was  finally  extracted  and  used  in  the  scale  invariance  analysis. 

A  representative  day-time  RRi  data  is  shown  in  Fig.  2a.  Scale  invariance  in  the 
RRi  data  may  be  implied  by  the  power-law  trend  of  the  power  spectrum  (Fig.  2b). 
But  it  is  clear  that  a  single  power-law  is  not  sufficient  to  describe  the  spectrum  over 
the  full  frequency  range. 

3.2  Numerical  Results 

Before  applying  the  numerical  method  to  the  RRi  data,  we  first  tested  it  on  the 
fractional  Brownian  motion  of  scaling  exponent  h0  =  0.26,  0.5,  0.78.  Each  artificial 
time  series  had  40,000  samples  and  was  generated  by  using  the  spectral  method19,20 . 
Fig.  3a  shows  a  typical  case  of  the  set  J\f  plotted  on  the  h  -  T  plane  where  T  = 
2fc, k  =  1,  2,  •  •  ■  Sometimes,  it  is  useful  to  consider  £ m( T;  h)  over  all  T  sets  for  a 
given  h.  The  result  is  shown  in  Fig.  3b  where  the  scaling  exponent  is  indicated  at 
the  location  of  the  peak.  It  is  clear  that  the  desired  scaling  exponents  were  captured 
with  good  accuracy. 

We  next  apply  the  spectral  method  to  construct  a  two-exponent  artificial  time 
series  of  216  data  points  (Fig.  4a).  The  local  scale  invariance  in  the  time  series 
was  defined  by  the  scaling  exponent  hQ  =  0.26  for  frequency  below  0. 0025Hz  and 
hQ  —  0.78  above  0. 0025Hz.  Fig.  4d  was  plotted  with  /i(T \h)  versus  h  and  the 
estimated  h's  were  within  3%  of  the  exact  values.  In  Fig.  4c,  we  checked  the 
minimum  time  scale  of  the  exponents,  which  is  defined  by  the  smallest  element  of 
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Figure  3.  (a)  The  set  y  on  the  h-T  plane,  dark  pixels:  \fi\  =  0,  white  pixels:  |/i|  >  0  for  h0  =  0.78. 
(b)  ^2  h)  vs-  h  f°r  ho  =  0.26, 0.5, 0.78. 


the  T  set.  This  is  obtained  by  constructing  the  set  r](T;  h)  where  T  is  the  smallest 
A  in  the  corresponding  T  set.  The  set  r)  shown  on  the  h  —  log2(  T)  plane  reveals  the 
minimum  time  scale  from  which  the  exponent  is  defined  (Fig.  4c).  For  the  artificial 
time  series,  it  was  found  that  correct  time  scales  of  the  exponent  were  captured: 
i.e.,  the  “fast”  dynamics  (above  0. 0025Hz)  of  h0  =  0.78  prevails  in  small  A’s  and 
the  “slow”  dynamics  of  h0  =  0.26  at  large  A  (see  Fig.  4c  caption). 

To  study  the  RRi  data,  the  following  form  of  probability  density  function  was 
assumed  (based  on  Fig.  lc): 


9t(n)  =  (4) 

7 

The  RRi  data  of  all  the  test  subjects  exhibit  multiple  scaling  characteristics.  In 
what  follows,  we  will  present  the  evidence  from  a  typical  individual  whose  data  has 
been  given  in  Fig.  2.  In  Fig.  5a,  multiple  peaks  were  seen  from  the  vs.  h 
plot,  indicating  multiple  scaling  of  HRV.  At  least  five  “significant”  exponents  were 
identified  (a  ~  e  in  Fig.  5a).  The  minimum  time  scale  of  these  exponents  scattered 
over  the  range  of  A  =  21  to  28  (Fig.  5b).  The  inverse  of  this  range  covers  the 
“frequency”  <  0.0039  (1/beat)  to  0.5  (1/beat)  in  the  power  spectrum  (Fig.  2b).  In 
this  range,  two  linear  regions  of  slopes  C  ^  —1.4  and  -2.1  may  be  roughly  defined. 
Based  on  £  =  1  4*  2h,  they  match  nicely  to  two  of  the  peak  locations  h  =  0.17  and 
0.6  in  Fig.  5a.  The  size  of  the  scaling  interval  is  shown  by  the  set  A f  plotted  on  the 
h-  |T|  plane  (Fig.  5c).  A  wide  range  of  scaling  interval  associated  with  the  scaling 
exponents  found  in  Fig.  5a  is  again  seen. 

Finally,  we  compared  our  method  with  the  detrend  fluctuation  analysis 
(DFA)8’10  and  found  noticeable  difference.  In  Fig.  6  shows  the  double-logarithmic 
plot  of  the  average  variance  versus  the  scaled  window  length.  Only  one  exponent 
can  be  ascertained  here  since  the  local  scale  invariance  characteristics  have  been 
averaged  out  in  the  process  of  DFA. 
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(a)  (b) 


Figure  4.  (a)  Two-time-scale  artificial  time  series,  (b)  power  spectral  density  of  the  time  series: 
the  scaling  exponents  are  ha  =  0.26  for  /  <  0.0025 Hz  and  ha  =  0.78  for  /  >  0.0025 Hz.  (Note: 
0.0025  =  2~8-6).  The  line  segments  have  slops  of -1.52  and  -2.56,  respectively,  (c)  The  set  q 
plotted  on  h  -  log2(T)  plane,  dark  pixels:  \q\  =  0,  white  pixels:  I77I  >  0.  The  scaling  range  for 
h  =  0.77  was  captured  for  T  <  27  and  that  for  h  =  0.27  for  T  =  27  ~  29.  (d)  2>(T ;ft)  vs.  h 
plot. 


0  0.5  1  0  0.5  1  0  0.5  1 

(a)  (b)  (c) 

Figure  5.  (a)  h)  vs.  h  plot.  Identified  peaks  at  h=0.17,  0.24,  0.31,  0.44,  0.6  are  labeled  as 

a,  6,  c,d,  e,  respectively;  (b)  the  set  q  plotted  on  h  -  log(T)  plane,  dark  pixels:  |^|  =  0  and  white 
pixels:  |r?|  >  0,  (c)  The  set  of  M  plotted  on  h  -  log2(T)  plane,  dark  pixels:  \Af\  =  0,  white  pixels: 
|Af|  >  0. 

4  Discussion  and  Future  Outlook 

Numerical  evidence  of  multiple  scaling  of  HRV  in  physiologically  relevant  range 
has  been  presented  for  the  case  of  healthy  young  adults.  The  scale  invariance  was 
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Figure  6.  Detrend  fluctuation  analysis  on  the  RRi  data  shown  in  Fig.  2a:  averaged  variance  of 
the  detrend  time  series  vs.  window  length  plot  on  double  logarithmic  scales  (symbol:  ’o’).  The 
fitted  line  of  slope  ~  1.05,  suggesting  h  =  0-05,  was  shown  as  the  solid  line. 


extracted  based  on  the  property  of  the  family  of  RRi  increment  probability  density 
functions.  This  approach  enables  us  to  explore  local  scale  invariance  property  in 
the  data.  Although  it  is  only  a  bit  costlier  in  computation,  details  previously 
unavailable  from  other  global  methods,  such  as  the  power  spectral  density  function 
or  DFA,  may  be  obtained.  It  should  be  noted  that  the  current  methodology  is 
not  able  to  extract  temporal  structure  of  very  short  duration.  When  such  a  rare 
event  occurs,  its  characteristics  will  yield  to  that  generated  by  the  more  “regular” 
dynamics  in  the  process  of  estimating  the  density  function.  What  this  work  has 
shown  is  that  even  the  “regular”  neuro-control  of  our  cardiovascular  system  is  rich 
enough  to  encompass  a  wide  range  of  time  scales  and  scaling  structures,  i.e.,  multiple 
scaling.  Indeed,  the  process  is  very  complex,  as  Hausdorff  and  Peng  showed  from 
artificial  time  series  that  the  “balance”  of  different  inputs  to  the  system  is  also 
crucial  to  generate  the  1/f  scaling8. 

More  detailed  characterization  of  the  density  function  (4)  is  underway  and  will 
provide  the  insight  of  the  generating  mechanism  of  multiple  scaling.  For  example, 
the  variation  of  the  parameters  a ,  f3  and  7  in  (4)  can  imply  self-similarity  of  the 
individual  density  function  at  specific  time  lag  (A f).  This  in  turns  can  lead  to  scale 
invariant  solution  of  (1)  over  finite  n-interval.  In  particular,  given  a  A  £  R  and 
n'  £  In( A£),  assume  the  graph  f\t{n)  is  self-similar  in  the  interval  In\ 


AA/Ai(AAn')  =  fxt(n'). 

(5) 

Since  it  can  be  written 

> 

IT 

II 

> 

s* 

(6) 

(5)  implies 

AA/At(AAn')  =  A  hft(n") 

(7) 
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where  n!  —  A hn” .  Re-arranging  terms  in  (7),  one  finds 

Xh+Afxt(\h+An")=ft(n")  (8) 

which  is  of  the  same  form  as  (1).  Hence,  (8)  implies  the  existence  of  a  new  exponent 
h  +  A.  Substituting  the  newly  found  exponent  into  the  scale  invariance  formalism 
(1),  even  more  can  be  revealed  by  following  the  same  arguments.  In  general,  we 
found,  when  certain  conditions  are  met,  the  number  of  exponents  can  “grow”  as  a 
power-law  by  repeating  (5)  with  all  the  exponents.  However,  the  scaling  interval 
In  also  shrinks  at  the  same  time  by  a  power-law,  making  most  of  the  exponents 
“unobservable22”  ,  We  will  report  more  details  in  a  future  publication. 
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A  SEMI-CONTINUOUS  BOX  COUNTING  METHOD  FOR  FRACTAL 
DIMENSION  MEASUREMENT  OF  SHORT  SINGLE  DIMENSION  TEMPORAL 

SIGNALS-Preliminary  study 
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Box  counting  method  allows  to  measure  the  eventual  fractal  dimension  (D)  of  a  single  dimension 
temporal  signal.  However  its  accuracy  varies  as  a  function  of  the  frequency  sampling  (Fs)  and  the 
duration  of  the  tested  signal  (Sd).  Consequently,  as  it  is  impossible  to  highly  increase  Fs,  this  method 
is  not  suitable  for  short  physical  signals  D  measurement.  Thus,  we  designed  a  semi-continuous  box 
counting  method  (SCBC)  allowing  a  better  approach  of  the  small  scales  of  the  signal,  especially 
useful  in  case  of  short  single  dimension  temporal  signal. 

Let  N  =  number  of  samples  of  the  tested  signal.  SCBC  provides  with  the  first  M  points  of  the  graph 
log  -  log  owing  to  the  dyadic  division  of  boxes  at  large  scales  up  to  a  certain  box  size  SM,  such  as  SM 

=  2  /Fs.  Then,  at  smaller  scales,  for  each  successive  point  the  box  size  decreases  by  1/Fs,  that 
provides  the  -  log  with  a  large  number  of  points.  Thus,  when  N/S(M+x/Fs  is  not  a  whole  number,  the 
analyzed  signal  is  peripherally  and  symmetrically  reduced  in  abscissa  and  ordinate,  so  that  a  whole 
number  of  boxes  is  obtained.  But  these  truncated  samples  are  then  reintroduced  for  designing 
following  boxes.  Using  SCBC  we  measured  D  of  mathematical  signals  which  D  is  known,  and 
compared  these  results  to  those  obtained  using  the  classic  dyadic  box  counting  method. 


1  Introduction 

A  continuous  physical  signal  such  as  a  sound  signal,  of  which  speech  is  an  example, 
constitutes  a  time  series.  If  this  series  is  self-similar,  measurement  of  its  fractal  dimension 
(D)  allows  fluctuations  to  be  characterised  by  quantifying  the  complexity  and  irregularity 
of  this  signal.  The  classic  box  counting  method  seems  the  most  appropriate  to  quantify 
complexity  in  the  temporal  succession  of  events.  As  the  box  size  is  divided  by  2  for  each 
measurement,  this  method  may  be  termed  dyadic  box  counting  method  (DyBC).  In  the 
framework  of  our  research  to  improve  signal  processing  in  auditory  prosthesis  f1,2,3],  we 
believed  that  this  eventual  D  of  speech  elements  could  help  patients  with  implants  to 
recognize  speech  without  having  to  lip-read. 

In  a  recent  study  [10]  we  carried  out  a  fractal  approach  of  vowels.  But,  unlike 
mathematical  signals,  a  physical  signal  cannot  be  fractal  at  every  scale.  However,  let  be 
N(r)  the  number  of  boxes  filled  by  events  at  the  resolution  r,  by  definition  the  generalised 
dimension  Dgen  of  the  graph  of  the  signal  amplitude  vs  time  is  obtained  using  the  limit 

Dgen  =  lim  [log(N(r»]  /  [log(l/r)] 
r^O 

Thus,  in  our  study,  we  had  to  use  the  higher  resolution  to  approach  infinitely  small  time 
scales,  and  to  estimate  the  eventual  ffactality  of  vowels.  In  this  aim  we  could  observe  that 
it  is  possible  to  appreciate,  at  least,  the  tendency  of  the  points  set  by  calculating  the  slope 
of  the  last  3  points,  i.  e.  D  of  small  size  boxes  (ssD).  We  demonstrated  that: 
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•  in  case  of  sinusoidal  signal  for  which  Dgen  =  1 ,  ssD  allows  a  better  approach  of  the 
value  of  Dgen  than  using  all  points  for  the  slope  calculation; 

•  in  the  particular  case  of  speech,  ssD  is  able  to  supply  a  kind  of  significant  signature 
of  the  vowels  signal. 

However  we  noted  that  the  accuracy  of  DyBC  varies  as  the  ratio  of  [frequency  sampling  / 
frequency  of  the  tested  signal].  But  we  observed  also  that  this  accuracy  depends  on  the 
duration  of  the  tested  signal.  Indeed  several  authors,  e.  g.  Robinson  [5],  estimate  that  at 
least  10  points  are  necessary  to  characterise  the  eventual  fractality  of  a  physical  object. 
However,  let  N  be  the  sample  number  of  the  tested  signal,  p  be  the  number  of  points  on 

the  log  -log  graph,  obeying  N=2P,  then  one  may  observe  that: 

•  p  drastically  diminishes  when  N  decreases 

•  1024  samples  -  i.  e.  for  example  64  ms  if  Fs  =  16  kHz  -  supply  only  10  points; 
therefore  these  data  characterise  the  shortest  signal  which  may  be  studied  by  DyBC 
with  Fs  =  16  KHz. 

Consequently,  as  it  is  impossible  to  highly  increase  Fs,  DyCB  is  not  suitable  for  the 
measurement  of  the  dimension  D  of  short  signals.  Thus,  we  designed  a  semi-continuous 
box  counting  method  (SCBC)  allowing  to  approach  the  small  scales  in  case  of  short 
single  dimension  temporal  signal.  In  this  study,  we  describe  this  SCBC.  Then,  testing 
several  mathematical  signals,  either  with  a  known  dimension,  or  which  are  not  fractal,  we 
compare  its  results  in  various  cases  to  those  obtained  using  the  DyBC. 

2  Methods  and  Material 


2. 1  Measurement  methods 

Let  Ts  be  the  sample  duration  i.e.  1/Fs.  In  this  study,  in  order  to  first  compare  DyBC  and 
SCBC,  we  used  the  lowest  value  allowing  use  of  DyBC 

•  for  Brownian  signals:  Fs=44.1  kHz,  Sd=23  ms,N=1024 

•  for  sinusoidal  signal:  Fs=16  kHz,  Sd=64  ms,  N=1024  i.e 

Then,  in  order  to  appreciate  the  SCBC  efficiency,  we  use  various  short  duration  signals 
(0.36,  0.72  and  1.45  ms  for  Brownian  signals,  and  1,  2  and  4  ms  for  sinusoidal  signal)  and 
compare  the  measured  D  with  the  theoretical  D. 

2.1.1  The  dyadic  box  counting  method 

This  method  used  10  boxes  whose  sizes  vary  from  23  ms  (1024  Ts)  to  0.045  ms  (2  Ts)  for 
Brownian  signals,  and  64  ms  (1024  Ts)  to  0.125  ms  (2  Ts)  for  sinusoidal  signal. 

We  measured  the  slope  on  the  log-log  graph  of  these  10  points,  which  include  all 
scales  from  1024  Ts  to  2  Ts  (asD);  we  studied  also  the  small  scales  from  8  Ts  to  2  Ts, 
measuring  only  the  slope  of  the  3  last  points  (ssD)  on  the  log-log  graph. 

2.1.2  The  semi-continuous  box  counting  method 

This  method  is  directly  derived  from  DyBC.  It  provides  the  first  M  points  of  the  graph 
owing  to  the  dyadic  division  of  boxes  at  large  scales  up  to  a  certain  box  size  SM,  such  as 
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SM  =  2M/Fs.  Then,  at  smaller  scales,  for  each  successive  point  the  box  size  decreases  by 
1/Fs,  that  provides  the  log-log  graph  with  a  large  number  of  points.  Thus,  when 
(N/S(M+x/Fs)  is  not  a  whole  number,  we  have  to  peripherally  and  symmetrically  reduce 
the  analysed  signal  in  the  abscissa  and  the  ordinate,  so  that  a  whole  number  is  obtained. 
But  these  truncated  samples  are  then  reintroduced  for  designing  the  following  boxes. 

In  the  first  part  of  this  study,  M=6,  allowing  to  obtain  a  whole  set  of  36  points  (Table 
I).  We  also  separately  measured  asD,  which  is  supplied  by  the  slope  of  36  points  on  the 
log-log  graph  ,  and  ssD  supplied  by  the  slope  of  only  the  last  7  points,  corresponding  to 
the  small  scales  from  8  Ts  to  2  Ts. 

Table  I.  Sizes  of  the  36  successive  boxes  of  SCBC  in  case  of  D  measurement  of  a  64  ms  signal,  owing  to  a  16 
kHz  frequency  sampling,  with  M=6.  For  each  size  (i.e.  duration)  box,  Bs-sn  —  box  size  in  sample  number. 

In  this  example  a  dyadic  division  of  box  size  is  performed  to  obtain  the  first  6  points  on  the  log-log  graph,  i.e. 
large  scales,  from  1024/Fs  (64  ms)  to  32/Fs  (2  ms);  then,  for  small  scales,  a  1/Fs  (0.0625  ms)  decreasing  is 
realised  from  3 1/Fs  (1.9375  ms)  to  2/Fs  (0.1250  ms).  However,  when  the  ratio  number  of  samples  /  duration  box 
is  not  a  whole  number,  the  analyzed  signal  is  peripherally  and  symmetrically  reduced  in  abscissa  and  ordinate, 
so  that  a  whole  number  is  obtained.  But  these  truncated  samples  are  then  reintroduced  in  the  measurement  of  the 
following  boxes.  From  the  36  boxes  of  this  example,  this  table  only  indicates  some  data,  and  mainly:  the  values 
of  the  last  10  boxes;  the  values  of  the  box  corresponding  to  the  biggest  signal  reduction  of  the  signal  (24 
samples);  in  this  case,  for  instance,  for  the  measurement  of  this  box  (1.525  ms),  the  12  first  and  12  last  samples 
of  the  1024  samples  which  constitute  the  tested  signal  have  not  been  analysed. 


Dyadic  division 
decreasing 

Box  duration 

Bs-sn 

64  ms 

1028 

32  ms 

512 

16  ms 

256 

8  ms 

128 

4  ms 

64 

2  ms 

32 

1/Fs 

decreasing 


Box  duration 


Part  of  the  signal 
which  is  not  analysed 
with  the  box  size  Bs-sn 


1.9375  ms 

31 

1.8750  ms 

30 

1.8125  ms 

29 

i  ■  1 

i  » 

%  i  « 

1  1.5625  ms 

25 _ 

- 1 - 

i  *  * 

i  i  > 

0.6250  ms 

10 

0.5625  ms 

9 

0.5000  ms 

8 

0.4375  ms 

7 

0.3750  ms 

6 

0.3125  ms 

5 

0.2500  ms 

4 

0.1875  ms 

3 

0.1250  ms 

2 

0.0625  ms 

1 

4  samples 


7  samples 


2  samples 


4  samples 


0  sample 


1  sample 


0  sample 
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In  the  second  part  of  our  study,  using  various  short  duration  signals,  M=0  allowing  to 
obtain  15,  31  and  63  points  for  0.36,  0.72  and  1.45  ms  in  case  of  Brownian  signal,  and  1, 
2,  and  4  ms  for  sinusoidal  signal.  In  these  cases  the  slope  of  the  log-log  graph  is  based  on 
the  10  last  points. 

2.2  Tested  signals 

We  used  4  mathematical  signals.  Some  of  them  are  fractal  with  a  known  dimension:  there 
are  3  Brownian  signals  (D  =  1.5,  1.4,  and  1.3).  Another  signal  is  not  fractal  (1  kHz 
sinusoidal  signal),  but  its  generalised  dimension  Dgen  is  known  and  equal  to  1.  For  each 
signal  we  randomly  selected  24  trajectories,  each  of  64  ms  duration.  We  used  Matlab 
software  [6]  to  transform  and  analyse  these  sounds  into  16  bit  "wav"  format. 

Using  both  methods  we  measured  each  of  these  24  D  values  and  calculated  the 
corresponding  D  mean  value  and  confidence  intervals. 

2.3  Statistical  study 

Results  were  studied  using  the  SPSS  statistical  package.  Repeated  measures  of  analysis  of 
variance,  using  statistical  contrasts  to  perform  pairwise  comparisons,  were  used  to 
compare  D  measures. 

Both  DyBC  and  SCBC  use  the  measure  of  the  slope  of  the  line  obtained  by  linear 
regression  on  the  log-log  graph  to  determine  D.  However,  only  one  of  the  two 
components  is  a  variable,  which  represents  the  fluctuation  in  the  signal.  The  other  is  a 
regular  function  of  time.  Therefore,  in  order  to  quantify  the  eventual  bias,  we  calculated 
the  slope  and  the  corresponding  error,  i.e.  the  difference  between  the  observed  ordinate 
and  the  theoretical  ordinate  on  the  regression  line. 


3  Results 

Measurement  of  slope  and  corresponding  error  gave  values  ranging  from  10-4  to  10'7.  For 
each  linear  regression  we  observed  a  very  small  error  ranged  from  10’4  and  10-7. 

Table  II.  Mean  value  (upper  line)  and  confidence  interval  (lower  line)  of  D  measurement  of  24  randomly 
selected  64  ms  duration  parts  of  mathematic  signals,  using  DyBC  and  SCBC,  as  a  function  of  the  size  of  the 
studied  scales  (all  sizes  —  asD,  from  23  to  0.045  ms  for  Brownian  signals  and  64  to  0.1250  ms  for  sinusoidal 
signal;  only  small  sizes  =  ssD,  from  0.18  to  0.045  ms  for  Brownian  signal,  and  0.5  to  0.125  ms  for  sinusoidal 
signal). 


DyBC  -  asD 

SCBC  -  asD 

DyBC  -  ssD 

SCBC  -  ssD 

10  boxes 

36  boxes 

3  boxes 

7  boxes 

Brownnian  signal 

1.47 

1.42 

1.34 

1.30 

D=1.5 

1.45-1.48 

1.40-1.43 

1.33-1.34 

1.30-1.31 

Brownnian  signal 

1.39 

1.36** 

1.27 

1.25 

D=1.4 

1.37-1.41 

1.35-1.38 

1.26-1.27 

1.25-1.26 

Brownnian  signal 

1.30 

1.28 

1.21 

1.25 

0=1.3 

1.29-1.33 

1.27-1.30 

1.20-1.21 

1.25-1.26 

1  kHz  Sinus,  signal 

1.83 

1.85 

1.16 

1.15 

Dge„=l 

1.83-1.84 

1.85-1.85 

1.09-1.23 

1.11  -  1.18 
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3. 1  Fractal  signals 


Results  are  summarised  in  Table  II. 

In  case  of  asD  measurement,  for  Brownian  signal,  for  which  D  =  1.5,  the  difference 
between  DyBC  and  SCBC  is  significant.  However  none  of  these  methods  includes  the 
theoretical  D  value  in  its  confidence  interval 

For  Brownian  signals  for  which  D  =  1.4  and  1.3,  the  difference  between  DyBC  and 
SCBC  is  not  significant. 

In  case  of  ssD  measurement,  the  difference  between  DyBC  and  SCBC  is  not 
significant  for  Brownian  signal  for  which  D  =  1.5.  It  is  significant  for  Brownian  signal 
with  D  =  1.4  and  1.3.  But  the  theoretical  D  value  is  never  included  in  both  methods' 
confidence  intervals  as  the  intervals  do  not  overlap. 

3.2  Non  fractal  signal 

In  cased  of  asD  measurement,  the  difference  between  DyBC  and  SCBC  is  not  significant. 
The  calculated  D  is  very  different  from  Dgen . 

In  case  of  ssD  measurement,  the  difference  between  DyBC  and  SCBC  is  not 
significant.  But  the  calculated  D  tends  to  approach  the  Dgen  . 

Table  III.  Mean  value  (upper  line)  and  confidence  interval  (lower  line)  of  D  measurement  of  24  randomly 
selected  parts  of  mathematical  signals  of  various  duration,  using  SCBC.  NAS  =  total  number  of  samples  which 
have  not  been  analysed. 


0.36  ms 

16  Ts 

15  points 

NAS=43 

0.72  ms 

32  Ts 

31  points 

NAS=229 

1.45  ms 

64  Ts 

63  points 

NAS=920 

Brownnian  signal 
D=1.5 

1.31 

1.27-1.34 

1.33 

1.31  -  1.35 

1.30 

1.28-1.32 

Brownnian  signal 
D=1.4 

1.25 

1.23-1.27 

1.30 

1.28-1.32 

1.30 

1.28-1.30 

Brownnian  signal 
D=1.3 

1.24 

1.23-1.26 

1.28 

1.27-1.29 

1.29 

1.28-1.31 

1  ms 

16  Ts 

15  points 

NAS=43 

2  ms 

32  Ts 

31  points 

NAS=229 

4  ms 

64  Ts 

63  points 

NAS=920 

1  kHz  Sinus,  signal 
Dgen“l 

1.32 

1.31  -  1.33 

1.19 

1.18-1.20 

1.15 

1.14-1.15 

3. 3  Short  duration  signals 

The  measured  D  value  progressively  approaches  the  theoretical  D  as  the  duration  of  the 
signal  increases  (Table  III).  In  case  of  Brownian  signals  with  D  =  1.4  and  1.3,  the 
differences  observed  between  each  pair  of  measures  are  significant  only  for  the 
comparisons  0.36  ms  vs  0.72  ms  and  0.36  ms  vs  1.45  ms.  In  case  of  non  fractal  signal  the 
difference  is  significant  for  the  comparisons  1  ms  vs  2  ms,  1  ms  vs  4  ms,  2  ms  vs  4  ms. 
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4  Discussion 


4 . 1  Methodology 

We  could  have  tried  to  study  other  fractal  signals  than  Brownian  signals.  However,  in 
practice  it  is  difficult  to  obtain  such  mathematical  signals,  which  are  surely  fractal;  this 
difficulty  is  evident  if,  for  instance,  we  consider  white  noise. 

The  value  of  M  is  important  and  we  shall  discuss  it  later. 


4. 2  Effic  iency  of  SCBC 

4.2.1  Fractal  signals 

In  case  of  asD  measurement,  one  may  observe  that  the  efficiency  of  both  DyBC  and 
SCBC  decreases  as  D  -  i.e.  the  signal  roughness  -  increases.  That  implies  that  the  signal 
irregularities  are  not  correctly  taken  into  account  by  the  relatively  narrow  window  which 
constitutes  the  tested  signal  duration. 

Besides,  in  case  of  ssD  measurement,  despite  the  difference  between  SCBC  and 
DyBC  being  significant,  none  of  these  methods  gives  correct  D  value.  That  is  probably 
due  to  the  fact  that,  in  this  particular  case  of  fractal  signal,  the  observation  of  only  the 
small  scales  does  not  include  the  long  dependence  of  the  Brownian  signal. 

4.2.2  Non  fractal  signal 

The  values  of  asD  measurement  are  very  different  from  the  true  value  of  Dgen.  That  seems 
normal,  because  the  concept  of  Dgen  is  valuable  only  on  infinitely  small  scales. 

Conversely,  ssD  tends  to  reach  the  true  value  of  Dgen,  and  we  may  observe  that  there 
is  no  significant  difference  between  DyBC  and  SCBC 

4.2.3  Short  duration  signal 

If  we  consider  Table  III  and  the  SCBC-asD  column  of  Table  II,  and  recall  the 
significance  of  the  difference  between  the  various  pairs  of  measurements,  we  may 
observe  that  the  efficiency  of  SCBC  depends  on  the  signal  duration,  whatever  the  signal. 
In  case  of  Brownian  signal,  this  efficiency  also  inversely  depends  on  the  D  value.  These 
discrepancies  may  be  explained  by: 

•  the  truncation  of  the  Brownian  signal,  which  distorts  its  correlation  function  adding 
high  frequencies  in  its  spectrum, 

•  the  temporary  suppression  of  some  samples  during  the  SCBC  management.  This 
suppression  represents  a  bias;  this  bias  probably  increases  with  the  number  of 
samples  which  are  not  analysed,  and  also  with  the  reduction  in  signal  duration. 

Here  the  value  of  M  must  be  discussed.  In  this  preliminary  study  we  arbitrarily 
chose  M=6  for  comparison  between  SCBC  and  DyBC,  and  M=0  for  testing 
efficiency  of  SCBC  in  case  of  signals  shorter  than  64  ms.  However,  the  fewer  are 
the  samples  which  are  not  analysed,  the  better  is  probably  the  SCBC  efficiency.  It 
could  be  worthwhile  to  improve  SCBC  efficiency  choosing  M  value  as  great  as 
possible,  in  such  a  way  as  to  obtaining  only  10  necessary  points  for  slope 
calculation. 
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Nevertheless  one  must  underline  that,  in  case  of  non  fractal  signal,  the  value  of  Dgen  is 
correctly  approached  by  SCBC. 

4.2.4  ssD  significance 

One  may  observe  that  the  use  of  asD  is  efficient  for  signals  with  long  correlations  (e.g. 
Brownian  signals),  while  ssD  is  more  efficient  for  signals  where  the  dimension  is 
observable  at  the  smallest  scales  (e.g  sinusoidal  signals)  or  for  signals  that  are  uncertain  to 
have  a  fractal  dimension.  But  fractal  dimension  is  always  defined  as  a  limit,  which 
implies  that  the  structure  can  be  analysed  in  arbitrarily  high  resolution.  However,  this 
trend  is  only  achieved  for  an  infinite  number  of  points,  which  is  conceivable  for  a 
theoretical  signal,  but  impossible  for  a  natural  signal.  In  practice,  if  we  consider  a 
physical  object,  its  structure  is  given  by  a  finite  data  set  representing  its  discrete 
digitisation.  Thus,  using  only  the  last  points  of  the  curve,  i.e  the  smallest  scales,  the 
dimension  of  an  object  which  is  fractal  is  not  measured  exactly.  But  ssD  may  be 
considered  as  a  kind  of  signature  of  the  signal,  which  may  be  characteristic  [4]  in  some 
cases;  the  advantage  of  this  estimation  of  ssD  is  that  it  highlights  the  convergence  to 
generalised  dimension  of  an  object  for  the  smallest  resolutions.  That  explains  why,  in  case 
of  sinusoidal  signal,  we  observed  that  the  measurement  of  ssD  gives  the  trend  towards  the 
correct  D  value  more  rapidly  (Fig.  1, 2,3,4). 
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Figure  1.  D  values  (ordinate)  obtained  using  box-counting  method  for  several  sinusoidal  sounds  as  a  function  of 
frequency  sampling  using  asD  measurement. 


Figure  2.  D  values  (ordinate)  obtained  using  box-counting  method  for  several  sinusoidal  sounds  as  a  function  of 
frequency  sampling  using  ssD  measurement. 


D 


Figure  3.  D  values  (ordinate)  obtained  using  box-counting  method  for  several  sinusoidal  sounds  as  a  function  of 
frequency  sound  using  asD  measurement. 
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Figure  4.  D  values  (ordinate)  obtained  using  box-counting  method  for  several  sinusoidal  sounds  as  a  function  of 
frequency  sound  using  ssD  measurement. 


4.3  In  the  literature 

One  may  find  other  methods  in  the  literature  to  determine  D  of  a  single  dimension 
temporal  signal.  In  our  Laboratory,  we  attempted  to  employ  these  methods  [7]  using  the 
same  mathematical  signals  as  we  used  in  this  paper.  All  these  results  may  be  compared. 

4.3.1  The  Richardson  length  method  [8] 

This  method  measures  the  perimeter  of  an  object  with  various  length  of  rulers;  when 
plotting  log  perimeter  against  log  ruler  length,  a  fractal  object  gives  a  straight  line  with  a 
negative  slope  S,  and  D  =  1  -  S.  But  this  method  is  difficult  to  be  computed  without  any 
risk  of  bias.  Moreover  most  signals  must  be  digitised  to  be  easily  studied  using 
computers;  consequently  they  are  not  yet  continuous,  burepresent  a  series  of  data. 
Anyway  Pickover  [9]  demonstrated  that  this  method  supplies  with  the  same  results  as 
using  box-counting  method  on  the  same  but  digitised  signal. 

4.3.2  The  power  spectrum  density  (PSD)  measurement 

This  technique  has  been  employed  [10]  using  the  Fourier  transform  generated  spectral 
density  function,  which  gives  intensity  or  power  at  each  frequency.  The  results 
demonstrate  that  this  method  is  valuable  for  sounds  with  PSD  in  1/f,  as  Brownian  signals 
for  instance.  But  it  is  not  usable  for  signals  analysis,  which  have  not  PSD  in  1/f. 
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4.3.3  The  rescaled  range  analysis  (RRA) 

This  is  the  earliest  empirical  method.  It  has  been  described  by  Hurst  to  study  the  long 
time  series  of  Nile  floods.  It  is  based  on  the  assumption  that  a  large  number  of  natural 
phenomena  are  time  series  with  a  long-term  correlation.  This  method  was  recently  studied 
by  Bassingthwaighte  et  al  [n].  It  allows  to  characterise  a  one-dimensional  time  series  by 
simultaneously  providing  a  measure  of  variance  and  long-term  correlation  of  its 
components  (the  term  «  memory  »  is  often  used).  But  the  results  obtained  with  the  signals 
where  the  PSD  is  not  a  power  law,  or  which  do  not  have  -  even  theoretically  -  a  fractal 
dimension  are  totally  incoherent.  This  is  the  case  for  a  constant  or  a  sinusoidal  signal. 
RRA  only  appeared  to  be  effective  for  the  signals  with  a  very  long  correlation,  or  a  power 
spectrum  of  1/f.  However,  even  in  these  cases,  application  of  known  Brownian  signals 
underestimated  true  H  for  H>0.72,  and  underestimated  H  for  H<0.72. 

4.3.4  The  dispersional  analysis  method  (DAM) 

This  method  is  also  empirical.  It  was  recently  studied  by  Bassingthwaighte  et  al  [12]  to 
evaluate  whether  it  was  effective  to  determine  the  fractal  dimension  of  a  Brownian  signal, 
according  to  the  type  of  this  signal  and  the  size  of  the  data  set.  DAM  measures  the 
standard  deviation  of  the  single  dimension  signal  at  different  scales.  The  objective  is  to 
reveal  a  possible  power  law  in  the  successive  values  of  the  standard  deviation  of  the 
signal.  But  for  the  signals  where  the  PSD  is  not  a  power  law,  or  which  do  not  have  a 
fractal  dimension,  the  results  are  similar  to  those  obtained  with  RRA.  Consequently, 
despite  the  fact  that  the  evaluation  of  true  H  of  Brownian  signals  is  more  precise  than 
with  RRA,  one  may  conclude  that  all  these  methods  are  less  efficient  than  box  DyBC  or 
SCBC. 

4.3.5  Other  tools 

Although  the  aim  of  this  paper  is  not  the  study  of  the  eventual  fractal  features  of  speech, 
one  may  briefly  mention  other  tools  proposed  by  several  authors  for  use  of  fractal 
geometry  as  a  model  for  describing  irregularities  of  graphical  wave  forms  of  human 
speech.  Bohez  et  al  [13]  used  box  counting  method  and  cluster  analysis;  however  Ds 
values  have  not  been  clearly  specified.  Using  different  box  shapes  and  a  specific  box¬ 
counting  algorithm,  Maragos  [14]  described  different  D  (1  to  1.3)  for  vowels  as  a  function 
of  their  scale  analysis,  and  his  results  are  in  accordance  with  our  first  measurement  [4]  . 
Other  authors  studied  the  multifractal  and  chaotic  features  of  speech,  but  we  shall  not 
discuss  them  in  this  paper,  which  only  considers  a  fractal  approach  to  single  dimension, 
short  temporal  signals. 


5  Conclusion 

As  SCBC  increases  the  number  of  points  at  small  scales,  it  improves  the  DyBC 
performances,  pushing  far  away  its  physical  limits  due  to  frequency  sampling  and 
duration  of  the  tested  signal.  However  its  efficiency  presents  comparable  limits: 

•  some  of  them  are  physical,  and  for  example  Fs=16kHz,  at  least  10  points  on  the 
log-log  graph  are  necessary,  the  shortest  signal  which  may  be  analysed  using  SCBC 
must  have  a  minimum  of  0.6875  ms  duration. 

•  others  are  due  to  the  principle  of  the  method,  which  leads  to  some  samples  not 
being  analysed.  However,  choosing  M  value  as  a  function  of  Sd  reduces  the  number 
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of  points  necessary  for  the  slope  calculation  to  10  on  the  log-log  graph  could 
improve  SCBC  efficiency. 

We  plan  to  study  further  this  possible  improvement.  Then  we  shall  use  this  method  to 
explore  the  eventual  dimension  D  of  transient  parts  of  consonants. 
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A  FRACTIONAL  BROWNIAN  MOTION  MODEL  OF  CRACKING 
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An  attempt  is  made  to  find  the  fractal  cutoff  of  crack  profiles  on  the  tension  face  of  concrete  beams 
subjected  to  uni-axial  bending.  Previous  work  by  the  authors  has  shown  that  such  cracking  can  be 
interpreted  as  a  non-Fickian  diffusive  phenomenon  resulting  from  a  self-affine  random  fractal 
process:  specifically  fractional  Brownian  motion  (fBm).  In  addition,  a  spatial  description  of  the 
cracking  geometry  can  be  found  from  experimental  data  using  both  a  (Hurst)  scaling  exponent  and  a 
diffusion-type  coefficient.  Herein  the  authors  find  that  the  fractal  description  of  the  crack  profiles 
extends  down  to  less  than  0.75|im.  The  use  of  a  scanning  electron  microscope  to  probe  the  crack 
profile  (and  surface)  at  smaller  scales  is  discussed  and  the  synthesis  of  crack  surfaces  using  fBm  is 
described  briefly. 


1  Introduction 

An  understanding  of  the  behaviour  of  cracking  in  structural  elements  is  of  great 
importance  in  the  analysis  and  subsequent  safe  design  of  engineering  structures.  As  yet, 
however,  there  is  no  definitive  theoretical  framework  for  the  propagation  of  cracks  and 
resulting  fracture  energy.  It  has  recently  been  found  that  the  irregular  geometries  of  both 
crack  surfaces  and  crack  profiles  in  a  variety  of  materials  may  be  described  (and 
subsequently  modelled)  using  fractal  geometry.  The  use  of  fractal  geometry  to  describe 
cracking  phenomena  is  now  widespread  (e.g.  see  references14).  Previous  work  by  the 
authors5,6  has  shown  that  crack  profiles  on  concrete  beams  in  tension  can  be  modelled  as 
fBm  trace  functions  which  require  a  Hurst  exponent  and  a  spatial  diffusion  coefficient  to 
completely  describe  the  spatial  distribution  of  the  crack.  In  addition,  the  authors  have 
linked  the  fBm  description  of  the  cracking  phenomena  to  an  effective  Fokker-Planck 
equation  which  described  the  diffusive  nature  of  the  cracking  phenomena  through  space. 
In  this  paper  the  cracks  are  investigated  at  higher  resolutions  in  an  attempt  to  determine 
whether  a  fractal  cut-off  scale  exists.  The  value  of  such  a  Euclidean  threshold  is  of 
significant  importance  in  the  determination  of  the  energy  of  fracture. 


2  The  Diffusive  Nature  of  fBm 

Fractional  Brownian  Motion  (fBm)  is  a  generalisation  of  Brownian  motion  suggested  by 
Mandelbrot7  which  has  found  a  variety  of  uses  in  the  natural  sciences  (see  for  example 
Addison  and  Ndumu8  and  the  references  contained  therein).  Fractional  Brownian  motion 
is  defined  as: 


yW 


iH+'A) 


1° 


(* 


x')H  X_(_y)w  X  dW(x')  +  \l  (x-x')H  ^  dW(x' 
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where  dW(x)  is  a  Gaussian  random  function  with  zero  mean  and  unit  standard  deviation,  H 
is  the  Hurst  exponent9,  and  T  is  the  gamma  function.  When  H=  0.5  Eq.  (1)  models  classical 
Brownian  motion  which  produces  normal,  or  Fickian,  diffusion.  From  Eq.  (1)  it  may  be 
seen  that  the  fBm  process  is  correlated  over  all  length  scales,  i.e.,  it  has  an  infinite 
memory  associated  with  it. 


Figure  1:  Diffusive  scaling  of  an  fBm  ( H=0 . 75  K/=J0) 

An  example  of  a  superdiffusive  fBm  (i.e.  one  with  H  >  0.5)  is  shown  in  Fig.  1 .  The 
diffusive  scaling  of  the  fBm  process  shown  in  Fig.  1  may  be  defined  as 

ay=jlKj  sH  (2) 

where  <jy  is  the  standard  deviation  of  the  y-excursions  (Ay)  on  the  trace  for  a  window 
length  s;  Kf  is  a  fractal  diffusion  coefficient.  Eq.  (2)  is  in  fact  the  standard  deviation  of  the 
probability  density  function 


p{y,x) 


y1  ) 

4Kfx2H) 


(3) 


which  is  a  non-Fickian  scaling  of  a  Gaussian  probability  density  function  through  space. 
(If  H=\ 4  then  Eq.  (3)  reduces  to  the  solution  of  a  Fickian  based  diffusion  from  a  point 
source.)  Furthermore,  it  has  been  shown  by  Wang  and  Lung10  that  Eq.  (3)  is  the  solution 
to  the  effective  Fokker-Planck  equation: 


EiML=2HK 
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lH-\  G^Pjy^x) 
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(4) 


which  describes  the  probability  of  occurrence  of yix)  at  spatial  location  x.  Eq.  (4)  is  in  fact 
a  generalisation  of  the  classical  Fickian  diffusion  equation  with  a  spatial  diffusion 
coefficient.  The  equation  reduces  to  the  classical  equation  for  H~ 0.5. 


119 


It  can  be  seen  from  the  above  that,  over  a  large  number  of  realisations,  fBm 
approximates  a  non-Fickian  diffusive  process  described  by  Eq.  (4).  The  authors  have 
previously  shown  that  persistent  fBm  {H  >  0.5)  is  a  suitable  model  for  cracking  on  the 
tension  face  of  a  concrete  beam  in  bending5.  In  addition,  both  H  and  Kf  are  required  for  a 
complete  geometric  description  of  the  cracking  phenomena.  It  was  shown  by  the  authors 
how  these  parameters  can  be  found  from  experiment.  The  mean  values  of  Kf  and  H  were 
found  for  a  series  of  flexure  cracks  in  concrete  beams  to  be  0.084  and  0.77,  respectively 
(i.e.  superdiffiisive  surfaces  with  fractal  dimensions  between  1  and  1.5).  This  gives  the 
standard  deviation  of  the  cracking  across  the  beam  as  <jy  =  <J2  x  0.084  s° 77  where  both  s  and 

ay  are  expressed  in  millimetres.  Thus,  for  the  40mm  wide  specimens  used  in  the  study  the 
expected  standard  deviation  of  the  crack  displacement  across  the  beam  is  7.02mm.  These 
experimentally  derived  parameters  can  be  used  to  synthesise  crack  patterns  using  fBms. 
An  example  of  this  is  shown  in  Fig.  2  using  the  fBm  generation  method  described  by 
Addison  et  al.,u.  In  the  figure  a  crack  with  measured  values  of  H  and  A^of  0.75  and  0.133 
respectively  is  shown  together  with  a  synthesised  crack  with  the  same  parameter  values. 
The  similarity  between  the  two  traces  is  evident  from  a  visual  inspection  of  the  plot. 


Figure  2:  Comparison  of  synthesised  and  experimental  cracks 


3  Crack  Profile  Analysis 

Natural  fractals  tend  to  exhibit  fractal  characteristics  over  a  limited  range  of  scales12. 
Below  a  cut-off  level,  the  natural  fractal  object  tends  to  revert  to  a  Euclidean  form.  The 
authors  have  recently  pursued  the  search  for  the  cut-off  length  scale  in  the  crack  patterns 
as  it  has  implications  for  the  measurement  of  the  true  areas  of  crack  surfaces  and  hence 
energy  dissipation  across  the  surface.  Fig.  3  shows  one  of  the  cracks  studied  by  the 
authors  at  a  magnification  level  of  6*.  Seven  boxes  are  placed  on  the  crack  profile 
indicating  regions  where  a  closer  inspection  was  taken  of  the  crack  profile  at  the  higher 
magnification  of  40  x.  In  addition,  two  smaller  boxes  indicate  locations  where  the  crack 
was  studied  at  50*  and  100*  magnification,  respectively.  Fig.  4  contains  a  log-log  plot  of 
Gy  against  s.  From  such  a  plot  it  is  possible  to  calculate  both  H  and  Kf.  A  line  of  slope 
H=  1  is  also  given  in  the  plot,  corresponding  to  a  dimension  of  unity,  i.e.  a  smooth 
Euclidean  curve.  It  would  be  expected  that  the  plotted  curves  tend  to  this  slope  at  the 
Euclidean  cut-off. 
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Figure  3:  Crack  profile  showing  analysed  regions 


ln{s) 

Figure  4:  Logarithmic  plots  of  crs  versus  s  for  the  crack  shown  in  figure  3.  The  graph  shows  (extending  out, 
bottom  left)  two  sections  representing  50x  and  lOOx  magnifications. 


Table  1  contains  the  H  and  Kf  values  for  the  various  regions  of  the  crack  in  Fig.  3 
measured  from  Fig.  4.  It  can  be  seen  from  the  table  that  measured  H  values  for  the  whole 
crack  and  the  average  values  from  the  seven  boxes  A  to  G  are  in  good  agreement. 
However,  the  Kf  values  between  the  two  are  significantly  different.  In  fact  the  Kf  values 
vary  over  a  large  range  (0.004-0. 118)  across  the  selected  boxes.  The  reason  for  this 
variability  is  as  yet  unclear.  Zooming  in  at  50x  and  100*  again  produces  persistent  values 
of  the  Hurst  exponent.  It  can  be  seen  from  the  plot  that  the  fractal  description  of  the  curves 
extends  down  to  the  lower  limits  of  the  100*  magnification:  this  relates  to  a  resolution  of 
7.5  x  lO^mm  of  crack  per  pixel. 
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Tablel:  The  Measured  H  and  K/  Values  for  the  Crack  in  Figure  3. 


H 

Kf 

0.74 

0.069 

0.87 

0.093 

B  (40x) 

0.64 

0.004 

C  (40x) 

0.68 

0.008 

D  (40x) 

0.61 

0.018 

E  (40x) 

0.72 

0.015 

F  (40x) 

0.82 

0.118 

G (40x) 

0.83 

0.091 

Average  (A-G) 

0.74 

0.050 

50x 

0.71 

0.025 

lOOx 

0.74 

0.030 

4  Concluding  Remarks 

From  Fig.  4,  the  lower  limit  to  the  fractal  behaviour  of  the  cracking  patterns  (if  it  exists!) 
appears  to  be  below  0.75pm.  This  is  significantly  less  than  the  value  between  10  and  20 
pm  (which  is  the  approximate  size  of  calcium  silicate  hydrate)  suggested  by  Souma  and 
Barton4.  The  authors  have  recently  initiated  research  to  search  for  a  fractal  cut-off  scale  at 
higher  resolutions  at  the  crack  edge  using  a  scanning  electron  microscope  (e.g.  Fig.  5). 
This  work  has  so  far  proved  inconclusive  due  to  the  difficulty  in  finding  reasonable 
vertical  sections  through  the  crack  edge  at  these  higher  scales.  It  is  hoped  that  an 
improvement  in  the  experimental  techniques  will  lead  to  a  better  understanding  of  the 
crack  geometry  at  these  smaller  scales. 


Figure  5:  An  electron  microscope  image  of  the  crack  profile  at  2000 x 
magnification.  Note  the  difficulty  in  defining  the  edge  of  the  crack. 
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The  determination  of  the  true  fractal  cut-off  scale  has  implications  for  the  synthesis  of 
prefractal  fBm  crack  profiles  and  surfaces  and  hence  the  calculation  of  crack  energy 
across  the  crack  surface.  If  the  crack  surface  is  in  fact  fBm,  then  the  crack  profile  may  be 
treated  as  the  result  of  a  vertical  cut  through  the  surface13.  If  this  proves  to  be  the  case, 
then  it  is  known  that  the  fractal  dimension  of  the  surface  Dsurface  is  equal  to  Dproflle+\.  It  is 
relatively  simple  to  generate  such  a  surface  using  a  variety  of  methods.  It  should  be 
possible  therefore  to  synthesise  the  crack  surface  using  the  Kf  and  H  values  found  from 
experiment.  Fig.  6  shows  an  fBm  surface  generated  using  the  turning  bands  method14.  The 
authors  intend  to  pursue  the  measurement  and  synthesis  of  crack  profiles  and  surfaces  in 
order  to  define  the  fracture  energy  of  cracking  in  terms  of  a  fractal  geometric  framework 
based  on  fBms. 


*S» 


Figure  6:  An  fBm  surface  generated  using  the  turning  bands  method  (H=0.8) 
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Applying  variation-correlation  method  to  images  of  fractography  obtained  from  the  scanning  electron 
microscope  (SEM),  it  has  been  found  that  there  exists  a  fractal  characteristic  length  within  which  the 
fractured  surfaces  are  fractal.  Investigation  shows  that  the  fractal  characteristic  length  can  represent 
the  statistical  maximum  size  of  texture  of  the  SEM  image.  Multi-magnification  fractal  analysis  has 
shown  that  fractography  cannot  be  described  by  a  single  fractal  dimension  but  rather  a  series  of 
fractal  dimensions.  Fractal  study  on  the  fractured  surfaces  of  gray  iron  with  different  grain  size  has 
shown  that  there  exists  a  positive  relationship  between  tensile  strength  and  fractal  dimension. 
However,  no  essential  relationship  between  impact  toughness  and  fractal  dimension  for  HP26Cr35Ni 
alloy  could  be  obtained.  Fractal  dimension  is  sensitive  to  changes  of  mechanical  properties  caused  by 
geometric  factors  such  as  grain  size  and  is  suitable  to  quantitatively  describe  the  irregularity  of 
fractography.  For  a  given  fracture  mode,  fractal  dimension  is  not  universal. 


1  Introduction 

Fracture  phenomenon  is  one  of  the  most  intriguing  areas  in  material  science  and 
engineering  because  it  is  associated  with  the  life  and  safety  of  engineering  parts.  There  is 
a  lot  of  information  related  to  fracture  mechanism  in  fractography.  Fractal  based  studies 
on  fractography  become  important  because  of  the  problem  of  variance  of  conventional 
quantitative  parameters  with  scales.  During  the  last  twenty  years,  many  fractal-based 
methods  for  quantitative  analysis  on  fractography  have  been  developed.  The  methods  can 
be  categorized  into  the  following  three  classes:  (a)  slit  island  analysis  [1];  (b)  profile 
analysis  and  (c)  3-dimensional  (3D)  surface  analysis  Among  them,  3D  surface 
analysis  is  the  most  interesting.  The  fractal  characteristics  of  fracture  surfaces  and  the 
relationship  between  fractal  parameters  and  mechanical  properties  have  been  investigated 
using  these  methods.  Some  results  justify  that  fractal  dimension  can  be  a  measure  of  the 
toughness  of  a  material [1’ 9’ 10].  On  the  other  hand,  some  results  have  shown  the  existence 
of  a  universal  value  of  fractal  dimension  for  a  given  fracture  mode.  It  has  been 
conjectured  that  the  value  of  fractal  parameter  a  (cr=3-Z)  where  D  is  fractal  dimension)  is 
0.8  for  ductile  fracture  [U]  and  0.87  for  brittle  fracture  [3].  These  values  imply  that  there  is 
no  correlation  between  fractal  dimension  and  toughness  because  a  universal  value  denies 
any  dependence  of  fracture  features  on  a  specific  toughness.  In  this  paper,  a  new  3D 
surface  model  based  on  variation  profile  analysis  has  been  introduced.  Using  this  model, 
the  fractal  characteristics  of  different  fracture  modes  and  the  correlation  between  fractal 
dimension  and  mechanical  properties  have  been  studied. 


2  Experimental  Procedures 

A  typical  ductile  fracture  of  microvoid  coalescence  was  obtained  by  tensile  testing  a 
specimen  of  Assab  760  medium  carbon  steel.  A  transgranular  cleavage  mode  was 
obtained  by  fracturing  a  Charpy  V-notch  specimen  of  the  same  steel  after  immersing  it  in 
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the  liquid-nitrogen  for  60  minutes.  Fracture  of  a  pure  AI2O3  ceramic  specimen  appeared  to 
be  brittle  intergranular  mode.  To  analyze  the  influence  of  geometric  and  non-geometric 
factors  on  fractal  dimension  of  ffactography,  specimen  A  of  gray  iron  was  modified  with 
0.4%wt  75SiFe  inoculation,  while  specimen  B,  with  0.3%wt  FeSiCuMo  metal  stream 
inoculation  to  improve  the  graphite-eutectic  microstructures.  Tensile  strength  crb  was 
obtained  by  tensile  testing  the  various  materials.  A  series  of  specimens  of  HP26Cr35Ni 
alloy  with  the  size  4x4x35(mm)  were  taken  from  same  regions  of  solidification.  Thermal 
shocks  before  mechanical  testing  were  carried  out  0,  5,  10,  15  and  20  times.  In  the  thermal 
shock,  the  specimens  were  heated  to  1 100°C  for  10  minutes  followed  by  water-quenching. 
After  the  thermal  shock,  impact  toughness  was  obtained  by  impact  testing  without  Charpy 
V-notch.  The  nominal  compositions  of  these  materials  are  listed  in  Table  1 .  All  fracture 
surfaces  were  imaged  by  SEM  followed  by  digitizing. 


Table  1.  Composition  of  materials  investigated  in  wt%. 


Materials 

C 

Si 

Mn 

P 

S 

Cr 

Ni 

Fe 

760  steel 

0.5 

0.25 

0.5 

Bal. 

Gray  iron 

3.0 

0.12 

0.7 

<0.3 

<0.05 

Bal. 

HP26Cr35Ni 

0.55 

2.5 

2.0 

<0.04 

<0.04 

26 

35 

Bal. 

3  Variation-Correlation  Method 


The  variation  method  has  been  introduced  by  Dubuc  et  al  [4\  In  this  method,  an  e- 
oscillation  for  the  height  function  h(x)  of  a  profile,  v(x,  £•),  is  defined  as  follows: 


v(x,e)  -  ^maxW  ^min^’  I  x-x0  |<  £*,  ^ 

The  integral  V(e)  may  be  defined  as: 

V(e)=  jv(x,s)dx.  (2) 

V(e)  is  termed  the  s-variation.  The  limit  of  V(s)  is  zero  for  e— >0  due  to  continuity  of  the 
function  h(x).  The  rate  at  which  V(e)  scales  when  e—>0  is: 


V(z)cce2~d , 

which  provides  a  method  to  calculate  the  fractal  dimension  D.  The  variation  method  can 
be  interpreted  as  a  numerical  technique  of  box  counting  It  has  been  demonstrated  to  be 
more  accurate  than  any  other  techniques  for  fractal  analysis  of  self-affine  curves  I4,5]. 

Based  on  variation  method,  a  variation-correlation  model  for  3D  surface  has  been 
developed  [8].  In  this  model,  it  is  assumed  that  H(x,y)  is  a  dimensionless  field-like  variable 
of  a  surface  which  may  denote  distribution  of  height  or  angle.  If  HmJx,y,e)  and  Hmin(x,y,e) 
are  the  local  maximum  and  minimum  of  H(x,y )  in  an  ^-neighborhood  respectively,  a 
variation-correlation  function  Vcor(e)  is  defined  as  follows 


V car  («)  =<  I# max  (*.  *0  “  #  min  (*,  > 


(4) 
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The  symbol  <...>  refers  to  two  consecutive  averaging  operations.  If  the  variation- 
correlation  function  VCOf(s)  of  the  variable  H(x,y)  on  a  surface  obeys  a  power  law  in  some 
significant  range  of  scales  £mm<  £  <  £max,  i.e., 

vcoM)^2*  (5) 

where  the  fractal  parameter  a  =  3-Dcor,  the  variation-correlation  is  called  fractal  variation- 
correlation  function. 


4  Results  and  Discussion 


4. 1  Fractal  Characterization  of  Different  Fracture  Modes 

The  typical  fracture  modes  of  microvoid  coalescence,  transgranular  cleavage  and  brittle 
intergranular  fracture  are  shown  in  Fig.l  (a),  (b)  and  (c).  Fig.l  (d),  which  gives  the  Vcor-£ 
curves  of  these  fracture  surfaces,  shows  that  the  variation-correlation  appears  as  lateral 
growth  only  within  the  range  £  <  £max.  When  £  >  £max,  the  Vcor-£  curves  appear  to  reach 
plateau  value.  These  observations  imply  that  the  following  short-  and  long-range 
relationships  of  variation-correlation  versus  scale  are  true 


vcoM° c 


8  »  8, 


(6) 


From  Eq.(6),  the  log-log  plot  of  Vcor  versus  £  should  be  a  straight  line  in  the  short-range 
of  scales,  of  which  the  slope  is  different  from  that  in  the  long-range  of  scales.  To  estimate 
£max,  a  linear  measure  MF  is  introduced  as  follows 


where  5  is  the  degree  of  disperse  expressing  the  fluctuation  of  the  set  of  points  (xti  yt)  from 
the  fit  line.  8  is  defined  as 


where 


jr<>,  -x)(y,  - y ) 


I  O', -yr 
- 

i= i 
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(a)  (b)  (c) 


(d) 


(e) 

Fig-1-  SEM  fractography  and  fractal  characteristics,  (a)  Microvoid  fracture;  (b) 
Transgranular  cleavage  fracture;  (c)  Brittle  intergranular  fracture;  (d)  Variation- 
correlation  versus  scale  curve  based  on  variation-correlation  analysis;  (e)  Height- 
correlation  versus  scale  curve  base  on  height-height  correlation  analysis.  The 
resolution  of  the  images  is  68.75|xm/200pixels. 
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The  value  of  MF  falls  in  the  range  between  0  and  1.  When  MF  tends  to  1,  the  distribution 
of  the  points  (x,-  y,)  is  perfectly  linear.  When  M?  tends  to  0,  the  distribution  becomes  non¬ 
linear.  Table  2  shows  the  values  of  D  and  w  estimated  with  M^O.7,  while  the 
correlation  coefficient  r  of  log- log  plot  is  over  0.98,  indicating  Mp  is  a  better  linear 
measure  than  r. 


Table  2.  The  estimation  of  fractal  parameters  and  the  maximum  size  dmax  of  the  basic 
micro-morphology  element  on  the  fractography 


Fractography 

V  ariation-correlation 

Height-height  correlation 

Dmax 

DCor 

&max 

r 

Mf 

Ai*h 

£max 

r 

MF 

Microvoid 

type 

2.58 

64 

0.984 

0.7 

2.68 

5 

0.982 

0.7 

75 

Transgranular 

type 

2.54 

17 

0.983 

0.7 

2.58 

5 

0.972 

0.7 

31 

Intergranular 

type 

2.44 

1  26 

0.983 

0.7 

2.33 

10 

0.982 

0.7 

35 

The  above  results  confirm  that  there  exists  a  characteristic  length  £max  so  that  the  log- 
log  plot  of  variation-correlation  is  linear  within  for  the  fractography  of  a  real  material. 
Different  from  an  ideal  self-affine  profile,  for  example,  Weierstrass-Mandelbrot  curve 
whose  Vcor  appears  as  lateral  growth  at  all  possible  scales,  the  morphologies  of  fracture  of 
real  materials  are  constituted  by  those  basic  micro-morphology  elements  such  as  dimples, 
cleavage  facets  or  boundary  facets.  The  basic  micro-morphology  elements  are  affine  at  a 
limited  scale,  to  form  the  texture  of  SEM  images.  Assuming  that  is  the  expected 
value  of  maximum  variation  of  the  texture  at  all  possible  scales,  Vmax  is  bounded  within  a 
scale  range  (e  <  £max)  because  of  the  affinity  and  periodicity  of  the  fractography  textures. 
Therefore,  this  characteristic  length  is  associated  with  the  statistical  maximum  size  of  the 
fractography  textures.  When  the  measuring  scale  £  is  greater  than  £max,  the  largest  and 
smallest  values  on  the  fractography  are  always  included,  resulting  in  Vcor(£)=V  max. 

From  the  data  shown  in  Table  2,  the  value  of  DC0T  for  ductile  fracture  is  the  largest, 
while  for  brittle  intergranular  fracture,  is  the  smallest.  The  results  are  well  in  agreement 
with  the  irregularity  of  these  fracture  surfaces.  Fig.l  shows  some  small  dimples  are 
located  inside  the  larger  ones.  The  fractured  surface  experiences  a  large  amount  of  ductile 
deformation  and  hence  dimples  at  different  scales  are  formed.  Such  fracture  surface  has 
multi-scaling  similar  structures  leading  to  large  £max,  within  which  variation-correlation 
follows  a  lateral  growth  with  the  increasing  scale.  It  must  be  noted  that  a  large  £max  does 
not  necessarily  mean  multi-scaling  similar  structures,  because  it  is  a  parameter  related  to 
the  maximum  size  of  the  surface  texture.  For  the  brittle  intergranular  fracture,  the 
fractography  is  constituted  by  smooth  boundary  facets,  resulting  in  a  decrease  in  D^.  The 
irregularity  of  the  transgranular  cleavage  fracture  is  medium  in  spite  of  the  rather  rugged 
surface  because  of  the  smoothness  of  the  surface  of  each  grain. 

For  the  sake  of  comparison,  the  curves  of  height  correlation  versus  scale  using  height- 
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height  correlation  analysis  [12_,5]  are  also  shown  in  Fig.l  (e).  The  height-height  correlation 
function  C{s)  is  defined  as 


C(e)  =<  h{  s)  -  h(  0)  > 

where  h( 0)  is  the  height  of  reference  point  and  h(s\  the  height  of  the  points  with  e 
distance  from  the  reference  point.  The  estimated  values  of  the  fractal  parameters  are 
shown  in  Table  2.  Fractal  dimensions  from  height-height  correlation  appear  the  same 
tendency  as  those  from  variation-correlation  in  characterizing  the  irregularity  of  the 
fracture  surfaces.  However,  the  characteristic  length  emax  in  the  height-height  correlation 
analysis  represents  the  correlation  length  n2'15]  but  not  the  size  of  texture.  Consequently, 
the  fractal  characteristic  length  in  variation-correlation  analysis  is  a  more  suitable 
parameter  in  quantitative  ffactography  because  its  estimated  value  is  able  to  represent  the 
maximum  size  dmax  of  the  basic  micro-morphology  elements.  Consider  a  simple  case  of 
brittle  intergranular  ffactography,  which  simply  consists  of  uniform  morphology  elements 
with  maximum  diameter  dmax.  In  variation-correlation  analysis,  when  dmJ2  <  s  <  dmax, 
some  estimated  values  of  variation  in  statistical  measurement  have  already  reached  the 
value  of  Vmsx.  When  s  >  dmsK,  all  the  estimated  values  of  variation  are  Vmax.  Therefore, 
when  s  >  dmaJ2,  the  linearity  of  the  log-log  plot  becomes  worse  so  that  the  estimated 
value  of  £n}ax  falls  into  [c/max/2,  Jmax]  while  its  fluctuation  depends  on  the  given  value  of 
Mf.  The  results  in  Table  2  hence  verify  the  validity  of  the  analysis.  Therefore,  the  fractal 
characteristic  length  can  be  employed  as  a  statistical  measure  of  the  maximum  size  of 
texture  of  fracture  surfaces. 


4. 2  Relationship  between  fractal  dimension  and  mechanical  properties 

Fig.2  shows  the  SEM  ffactography  and  the  multi-magnification  fractal  curves  for  the  gray 
iron.  Specimen  A  has  been  modified  by  FeSiCuMo  metal  stream  inoculation.  Thus, 
microstructures  of  the  matrix  and  graphite  structures  were  improved,  leading  to  an 
increase  in  tensile  strength  crh  (crb= 3 14MPa  for  sample  A  and  255MPa  for  sample  B).  The 
elongation  for  gray  iron  is  almost  zero.  The  size  of  graphite  and  graphite-eutectic  cells  is  a 


(a)  (b) 

Fig.2.  SEM  fractography  of  samples  A  and  B  of  gray  iron,  (a)  SEM  ffactography  of 
sample  A;  (b)  SEM  fractography  of  sample  B.  The  resolution  of  the  images  is 
421pm/160pixels. 
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very  important  geometric  factor.  Small  grains  will  lead  to  multi-scaling  fractal  structures 
of  the  material  and  the  fracture  surface  becomes  more  complicated.  The  cleavage  facets  of 
specimen  A,  which  is  associated  with  grain  size,  are  smaller  than  those  of  specimen  B, 
leading  to  the  more  complicated  fractured  surface  as  shown  in  Fig.2  (a).  This  implies  that 
the  fractal  dimensions  of  specimen  A  is  larger  than  that  of  specimen  B.  Fig.3  is  the  curve 
of  fractal  dimension  versus  SEM  magnification,  which  verifies  this  analysis.  Fig.3  shows 
that  the  fractography  cannot  be  described  by  a  single  but  rather  a  series  of  fractal 
dimensions.  It  also  reveals  that  the  fractography  is  not  ideally  self-affine  at  all  scales  of 
SEM.  The  transgranular  cleavage  fracture,  for  example,  shows  cleavage  facets  as 
dominant  morphology  at  low  magnification  and  river  patterns  as  dominant  morphology  at 
high  magnification,  implying  that  fractal  dimension  for  a  given  mode  is  not  universal. 


Fig.3.  Fractal  dimension  Dcor-magnification  M  curves  of  samples  A  and  B. 


Fig.4  shows  the  austenitic  dendrite  structures  of  HP26Cr35Ni  alloy.  Because  the 
specimens  were  taken  from  the  same  region  of  solidification,  the  distribution  and  size  of 
austenitic  dendrites  of  all  the  specimens  are  the  same  even  though  they  have  been 
subjected  to  different  thermal  shocks.  Their  impact  toughness,  however,  is  different  from 
each  other  due  to  the  formation  of  micro-cracks  during  the  thermal  shock  treatment. 
Fractal  measurements  in  Fig.4  (d)  show  that  there  is  no  significant  difference  in  the  fractal 
dimensions  of  the  specimens  under  different  thermal  shocks.  Therefore,  there  is  no  direct 
correlation  between  fractal  dimension  and  toughness  when  the  change  in  toughness  is 
caused  by  non-geometric  factors. 

The  present  experiment  has  shown  that  fractal  dimensions  of  fractography  at  SEM 
scales  are  not  a  single  value  but  rather  a  series  of  values  for  a  given  fracture  mode.  This  is 
because  of  the  different  dominant  patterns  of  fractography  in  different  range  of 
magnifications.  It  can  be  deduced  that  when  the  magnification  is  high  enough,  the  fractal 
dimension  will  tend  to  2  because  the  micro-morphology  elements  approximately  become 
2D  surfaces.  The  irregular  geometric  structures  of  fracture  surfaces  can  reflect  the  state  of 
the  energy  absorbed  by  the  fracture  surface  due  to  the  formation  of  new  surfaces.  If  the 
change  in  toughness  is  only  result  of  the  irregularity  of  the  fracture  surface,  the  fractal  can 
be  employed. 
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(a)  (b)  (c) 


Number  of  thermal  shock 

(d) 

Fig.4.  Microstructures,  impact  toughness  and  fractal  dimension  of  HP26Cr35Ni  alloy 
specimens  subjected  to  thermal  shock  of  different  times,  (a)  The  origin  microstructure;  (b) 
microstructure  with  10  times  of  thermal  shock;  (d)  microstructure  with  20  times  thermal 
shock;  (d)  changes  in  impact  toughness  and  fractal  dimension  with  number  of  thermal 
shock. 


5  Conclusions 


The  variation-correlation  of  fracture  surfaces  imaged  by  SEM  has  been  found  to  follow  a 
power  law  only  within  a  limited  range  of  scale  of  e<£max.  Beyond  this  range,  the  variation- 
correlation  tends  to  be  a  constant.  The  scaling  properties  can  be  expressed  by 


Kor(  B)« 


s  «  s 


max 


e  »£ 


max 


Fractal  dimension  of  fractography  at  SEM  scales  is  not  a  single  value  but  rather  a 
series  of  values  for  a  given  fracture  mode.  There  exists  a  positive  correlation  between 
fractal  dimension  and  toughness  of  the  materials  when  the  change  in  toughness  is  caused 
by  geometric  factors.  However,  if  the  change  in  toughness  is  caused  by  non-geometric 
factors  such  as  micro-cracks,  there  is  no  essential  relationship  between  impact  toughness 
and  fractal  dimension.  For  a  given  fracture  mode,  fractal  dimension  is  not  universal. 
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The  process  of  coarsening  by  curvature  driven  interface  motion  of  fractal  interfaces  in  two-dimensional 
space  is  studied  by  analytical  and  numerical  methods.  A  statistical  model  is  presented,  which  allows  an 
analytical  treatment  of  the  main  features  of  coarsening  of  a  fractal  interface.  For  non-conserved  motion  the 
interface  is  described  by  a  statistical  distribution  function  of  size  scales,  which  obeys  a  continuity  equation 
in  size  space.  The  solution  of  the  continuity  equation  yields,  for  a  self-similar  initial  distribution  function, 
the  time  development  of  the  interface  in  terms  of  a  time  dependent  size  distribution  function,  which  exhibits 
a  growing  lower  characteristic  length  scale  and  leads  to  a  power-law  decay  of  the  total  interface  length.  The 
effect  of  coarsening  on  the  scale  of  observation  is  discussed. 


1  Introduction 

Coarsening  is  a  long-time  relaxation  phenomenon  that  is  connected  with  an  increase  in 
the  characteristic  size  scale  of  an  unstable  two-  or  multiphase  microstructure.  The  driving 
force  for  this  scale  coarsening  is  the  reduction  of  the  excess  free  energy  associated  with  the 
interface  between  different  phases.  The  interface  density  Sv  =S  IV ,  i.e.  the  interface  area 
S  per  unit  volume  V ,  decreases  with  time  and  therefore  the  characteristic  size  scale  of 
the  interface  system  given  by  l  =  1/  Sv  increases,  the  microstructure  coarsens.  For 
sufficiently  long  times  the  decrease  in  the  interface  density  obeys  the  asymptotic  power- 
law  Sv  « t~a ,  where  for  sufficiently  smooth  interfaces  the  coarsening  exponent  a  only 
depends  on  the  transport  mechanism  that  drives  the  interface  motion.  For  curvature  driven 
interface  motion  (interface  limited  coarsening),  as  it  is  the  case  for  antiphase  domain 
coarsening  [1]  and  normal  grain  growth  [2,  3],  the  kinetic  exponent  has  the  universal 
value  a  -  HI  (parabolic  growth  law).  In  the  case  of  diffusion  limited  coarsening  the 
exponent  is  a  -  1/3  (Ostwald  ripening  [2,  4]).  For  interface  limited  coarsening  the  local 
order-parameter  of  the  phase-ordering  dynamics  associated  with  the  interface  motion  is 
non-conserved,  while  for  diffusion  limited  coarsening  the  local  order-parameter  is 
conserved  [5]. 

Recently  there  has  been  a  great  deal  of  interest  in  the  coarsening  kinetics  of  fractal 
structures.  While  the  generation  of  fractal  patterns  far  from  equilibrium  has  been  widely 
studied  (cf.  e.g.  [6]),  for  the  reverse  process,  the  decay  of  a  fractal  structure,  much  less 
work  has  been  done.  In  complex  nonequilibrium  systems  such  as  fractal  interfaces  and 
clusters  the  dynamic  power  law  is  characterized  by  a  non-trivial  coarsening  exponent.  For 
the  non-conserved  curvature  driven  interface  motion  it  has  been  shown  theoretically  as 
well  as  experimentally  [7  to  10]  that  initially  fractal  interfaces  exhibit  a  power  law  decay 
with  a  coarsening  exponent  a  that  depends  on  the  fractal  dimension.  Although  similar 
results  has  been  found,  by  computer  simulation,  for  the  conserved  coarsening  dynamics  of 
fractal  clusters  [11  to  13],  a  complete  theoretical  description  is  still  lacking  for  this  case 
[13]. 
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However,  also  in  the  case  of  non-conserved  fractal  interface  coarsening  [7  to  10]  no  exact 
solution  is  available  because  the  equation  of  motion  of  complex  interfaces  is  highly 
nonlinear  and  can,  in  general,  be  solved  only  numerically.  Therefore,  Toyoki  and  Honda 

[7]  used  a  scaling  hypothesis  to  find  the  asymptotic  power  law  for  the  coarsening  of  a  non- 
random  self-similar  interface.  They  found  for  the  coarsening  exponent  the  relation 
a  =(D  +  l~d)i2 ,  where  D  is  the  fractal  dimension  of  the  interface  and  d  the 

Eucleadian  dimension  of  the  space.  The  same  result  was  derived  by  Orihara  and  Ishibashi 

[8] ,  however  for  a  random  fractal  interface  without  assuming  the  self-similar  evolution 
using  a  diffusion  equation  for  the  description  of  the  interface  motion.  We  could  confirm 
the  decay  law  by  experimental  observation  [9,  10]  and  by  a  Monte  Carlo  simulation  of  the 
smoothing  kinetics  of  initially  fractal  two-dimensional  grain  boundaries  [10]. 

In  the  present  paper  a  quite  different  approach  is  used  to  describe  the  coarsening 
process  of  a  random  self-similar  interface.  The  random  interface  is  described  by  a 
statistical  distribution  function  of  size  scales  or  curvature  radii,  which  obeys  a  continuity 
equation  in  size  or  curvature  space.  The  solution  of  the  continuity  equation  yields  for  non- 
conserved  motion  and  an  initial  fractal  distribution  function  the  time  development  of  the 
interface  in  terms  of  a  time  dependent  size  distribution  function,  which  exhibits  a  growing 
lower  characteristic  length  scale.  Besides  the  description  of  the  main  features  of  the 
coarsening  process  this  approach  is  also  of  physical  interest  because  it  is  an  example  of 
how  a  cutoff  of  a  statistical  fractal  may  emerge  in  a  complex  system. 

The  paper  is  organized  as  follows.  In  order  to  give  an  illustrative  insight  into  the 
coarsening  process  under  consideration,  in  chapter  2  the  dynamic  scaling  hypothesis  and 
its  confirmation  (within  the  range  of  a  lower  and  upper  cutoff)  by  a  Monte  Carlo 
simulation  is  shortly  discussed.  In  chapter  3  the  statistical  model  is  developed  which 
allows  for  a  simple  initially  random  self-similar  interface  system  an  analytical  solution  in 
terms  of  a  time  dependent  size  distribution  function.  A  summary  concludes  the  paper. 


2  Scaling  Hypothesis  and  Monte  Carlo  Simulation 

In  general  it  is  not  possible  to  solve  the  equations  of  motion  of  the  interface  analytically 
due  to  the  complexity  of  fractal  interfaces  in  conjunction  with  the  high  nonlinearity  of  the 
equations  of  interface  motion  [7,  8].  Therefore  the  scaling  hypothesis,  after  which  the 
interface  keeps  self-similar  during  temporal  evolution,  may  serve  as  a  simple  coarsening 
scenario  [7,  9,  10,  11].  If  we  restrict  ourselves  to  a  plane  section  of  an  otherwise  three- 
dimensional  interface  system  the  interface  density  is  proportional  to  the  total  length  L  of 
the  boundary  line  measured  on  the  plane  section  of  area  A  [9,  10].  For  a  non-random 
fractal  boundary  with  a  fractal  dimension  Ds  ( 1  <  Ds  <  2 ),  the  boundary  length  obeyes 
the  scaling  relation 


L  *  RZ(Ds~l) 


(1) 


where  Rm  is  the  characteristic  length  of  the  structural  unit  of  the  boundary  corresponding 

to  the  lower  cutoff  length  of  the  fractal.  The  main  effect  of  coarsening  arises  from  the 
growth  of  the  lower  cutoff  due  to  the  time  law 
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where  0=1  for  interface  limited  and  0  =  2  for  diffusion  limited  growth.  If  we  assume 
that  the  scaling  relation  Eq.  (1)  remains  valid  also  during  the  relaxation  process,  the 
boundary  length  and  therewith  the  interface  density  decreases  as 

Ds- 1 

L*f  .  (3) 

For  the  case  of  non-conserved  interface  motion  ( 0  - 1 )  we  have  tested  the  decay  law 
Eq.  (3)  by  performing  a  Monte  Carlo  simulation  of  the  thermal  relaxation  of  initially 
fractal  two-dimensional  grains  [10]. 

Using  the  Monte  Carlo  procedure  described  by  Anderson  et.  al.  [14]  for  modelling 
normal  grain  growth,  a  single  closed  grain  is  mapped  onto  a  discrete  square  lattice  of 
512  x  512  lattice  sites,  where  each  lattice  site  is  assigned  the  orientation  number  1  or  2. 
The  interface  is  defined  to  lie  between  two  sites  of  unlike  orientations.  Taking  only  an 
interaction  between  nearest  neighbours  into  account,  the  interface  motion  is  simulated  by 
employing  a  standard  Monte  Carlo  technique  in  conjunction  with  an  energy  criterion  for 
successful  transition  to  the  other  orientation. 


** 
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*  ♦  • 

180  760  2000 


Figure  1 :  Simulation  of  the  coarsening  and  shrinking 
process  of  an  initially  non-random  fractal  ( Ds  = 
1.5).The  numbers  represent  the  time  in  Monte  Carlo 
steps. 
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Figure  2:  Same  as  for  Fig.  1  but  for  an  initially 
random  shaped  interface  (Ds  -  1.22)  from  a 
metallographic  section  of  a  deformed  Zn  specimen. 


Figure  1  shows  the  simulation  of  the  relaxation  (and  shrinking)  process  of  an  initially 
non-random  fractal  interface,  the  so-called  square  Koch-island  (Ds=  1.5).  The  picture 
shows  that  the  simulation  clearly  confirms  ,  within  a  certain  range  between  the  lower  and 
upper  cutoff,  the  above  scaling  assumption  that  the  boundary  smoothens  out  step  by  step 
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from  the  higher  to  about  the  next  lower  generation  of  the  initially  self-similar  fractal 
pattern.  A  similar  simulation  is  shown  in  Fig.  2  starting  with  an  irregularly  shaped 
interface  with  an  initial  fractal  dimension  D  s  =  1.22,  which  corresponds  to  a  closed  grain 

boundary  from  a  metallographic  section  of  a  deformed  Zn  specimen  (for  details  see  [9 

10]). 


Figure  3:  Plot  of  ln(£)  versus  ln(t)  of  the  Monte  Carlo  simulation  for  three  different  fractal  interfaces.  The  frill  lines 
represent  the  least-square  fit  to  Eq.  (3)  (  P  =  1 )  (cf.  Table  I).  L  is  normalized  to  the  boundary  length  at  t  =  0  . 


The  log-log  plot  of  the  boundary  length  L  (perimeter)  of  the  grains  in  Figs.  1  and  2 
versus  simulation  time  t  (measured  in  Monte  Carlo  steps)  in  Fig.  3  shows  that  the 
coarsening  kinetics,  in  fact,  follows  over  a  wide  range  the  time  law  Eq.  (3)  for  p  =  1  The 

fractal  dimension  derived  from  the  coarsening  kinetics,  i.e.  from  the  slopes  of  the  full  lines 
in  Fig.  3  via  the  time  exponent  (1  -  Ds  )/2  in  Eq.  (3)  (for  p=\),  agrees  well  with  the 
fractal  dimension  determined  by  the  geometry  of  the  initial  interface  (Table  1).  For  the 
random  interface  (Fig.  2),  as  time  elapses,  the  crossover  from  self-similar  boundary 
coarsening  to  normal  grain  shrinking  kinetics  governed  by  the  parabolic  time  law  L02  -  L2 
~  t  can  be  observed  in  Fig.  3  [10]. 

In  the  following  paragraph  a  statistical  coarsening  model  for  a  much  simpler  interface 
configuration  is  presented,  which  allows,  however,  an  analytical  treatment  beyond  the 
above  scaling  hypothesis  of  all  the  main  features  of  self-similar  interface  coarsening. 
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Table  I:  Fractal  dimension  of  self-similar  interfaces  from  the  coarsening  kinetics 


Ds  (from  the  fractal 
geometry  of  the  initial 
interface) 

Ds  (from  the  coarsening 
kinetics,  Fig.  3  and  Eq.  (3)) 

Frak  150:  square  Koch  island 

1.50 

1.54 

Frak  121:  modified  Koch  island 

1.21 

1.20 

Zink:  random  interface 

1.22  (yardstick  method) 

1.25 

3  Statistical  Model  of  Non-Conserved  Coarsening  of  a  Self-Similar  Interface 

In  the  following  considerations  we  replace  the  complex  fractal  boundaries  of  the  preceding 
paragraph  by  the  somewhat  simpler  system  of  a  two-dimensional  random  assembly  of  non¬ 
overlapping  and  non-interacting  closed  circular  loops  of  different  curvature  radii  R  .  For  a 
non-conserved  system  the  equation  of  motion  for  a  single  circular  interface  with  radius  R 
is  given  by 


R  =  -kKfi  =  -k-Kr , 
Rp 


(4) 


where  R  is  the  radial  velocity,  K  the  curvature  and  k  the  mobility  of  the  interface. 
Again  p  =  I  for  interface  limited  growth  and  p  =  2  for  diffusion  limited  growth.  For  the 
description  of  the  time  development  of  the  total  interface,  i.e.  of  the  whole  assembly  of 
circular  loops,  we  introduce  the  size  distribution  function  F(RJ),  where  F(Rj)dR  gives 
the  number  of  loops  per  unit  area  with  radius  between  R  and  R  +  dR .  For  a  random 
fractal  interface  the  initial  size  distribution  function  is  assumed  to  have  the  hyperbolic 
form 


F(R, 0)  =  F0  (R)  =  BR  ~Ds~' ,  (5) 

where  Ds  is  the  fractal  dimension  of  the  whole  assembly  and  B  a  constant.  Since  during 
coarsening  no  new  loop  is  created  and  the  only  process  of  annihilation  of  interfaces  is  the 
shrinking  of  loops  according  to  the  growth  law  Eq.  (4)  the  size  distribution  function  must 
obey  a  continuity  equation  in  size  space  given  by 

&XRJ)  +JLlRp(R  t)\  =  o  ,  (6) 

dt  3Ry  ’ 


For  a  given  initial  distribution  F(R,0)  =  F0  (R)  the  solution  of  the  first-order  partial 
differential  equation  (6)  can  be  written  in  the  form  [15] 
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F(Rj)  =  F0(R0(R,t))^j^,  (7) 

where  Rq  =  Rq(Rj)  is  the  solution  of  the  characteristic  equation  (4)  with  the  integration 
constant  R0  -  R(t  -  0) .  The  integration  of  Eq.  (4)  yields 

i 

R0=(r/3+'  +(/S+l)kt)J^ .  (8) 

Equations  (5)  and  (8)  inserted  into  Eq.  (7)  leads  to  the  size  distribution  function 

fi+Ds+ 1 

F(R,t)  =  BRP\RP+] +  0+'  .  (9) 

At  t  =  0  Eq.  (9)  reduces  to  the  initial  distribution  Eq.  (5).  At  t  >  0 ,  in  contrast  to  the 


R 


Figure  4:  Size  distribution  function  Eq.  (9)  for  ft  =  1  and  Ds  =  1.5  at  different  times  t.  Dot-dashed  line:  t  =  0 
(initial  distribution  Eq.  (5)),  full  line:  q  >  0  ,  dotted  line:  t2  =  2/, ,  dashed  line:  t3  =  6q  .  R  and  t  are  given  in  terms 
of  arbitrary  unit  length  and  unit  time,  respectively. 


initial  distribution  function,  F(R,t)  exhibits  a  maximum  and  tends  towards  zero  for 
vanishing  radius  R  (Fig.  4).  The  location  of  the  maximum  defines  a  natural  characteristic 
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length  scale  or  lower  „cutoff  ‘  Rm  =  Rm(t)  where  scaling  breaks  down: 


Kit)  = 


Mi+i y 

v  R>s  + l  > 


>3+1 


(10) 


In  the  following  discussion  we  restrict  ourselves  to  interface  limited  coarsening 
(/?  =  1 ),  where  Rm  increases  with  time  as 


Rm(0  = 


f  \  1/2 

1  2  kt 


De+l 


(11) 


The  total  number  of  loops  decays  as 

jv(0 = j =-£-{2ktYDs'2 , 


(12) 


which  is  associated  with  a  decrease  in  the  total  length  of  the  interface 


Iff)  =  2ttJ  RF(R,t)dR  =  2nBg{0,Ds)(2kt) 
0 

In  this  relation  g(0,  Ds )  is  defined  by  the  integral 


ds~  i 
2 


(13) 


g(x,Ds)  =  ]t2(t2 +l) '2  rff. 

X 


(14) 


For  the  average  radius  we  obtain 


<R>= 


N(t) 


2ng(0,Ds)(2kt)U2 , 


(15) 


i.e.  <R>*  Rm(t) . 

The  power-law  behaviour  of  the  interface  length  Eq.  (13)  results  in  the  present 
approach,  without  assuming  the  scaling  hypothesis  Eq.  (1)  during  the  temporal  evolution, 
from  the  initial  fractal  distribution  function  in  conjuntion  with  the  growth  law  Eq.(4)  and 
the  continuity  equation  (6).  While  for  a  deterministic  fractal  the  lower  cutoff  length  is 
simply  defined  by  the  lower  limit  of  the  geometric  building  block,  for  the  present  statistical 


fractal  the  lower  characteristic  length  scale  emerges  from  the  physics  of  the  coarsening 
process.  The  emergence  of  empirical  random  fractals  limited  by  a  lower  and  upper  cutoff 
has  been  discussed  from  a  different  point  of  view  also  in  [16]. 

The  present  model  allows  in  a  simple  but  instructive  way  to  study  the  effect  of 
coarsening  on  the  resolution  of  the  interface  length  by  measurement.  If  we  assume  that  for 
a  certain  measuring  device  the  resolution  of  length  scale  is  restricted  to  radii  Rf>R  the 
number  of  loops  that  can  be  observed  by  that  device  is  given  by 

»  R  _£s_ 

N(R,t)  =  jF(R',t)dR'=—{R2+2kiy2  .  (16) 

R  S 

The  total  interface  length  that  can  be  measured  is 

L(R,t)  =  2n\R'F{R',t)dR' =  2xB{2kt)—r g{R  /  (2kt)m ,DS) ,  (17) 

R 

where  the  function  g(x,Ds )  is  defined  by  the  integral  Eq.  (14). 

Fig.  5  shows  in  form  of  a  log-log  plot  the  observable  interface  length  Eq.  (17)  versus 


“3-2-10  1  2  3 

tn(R) 

Figure  5:  Log-log  plot  of  the  interface  length  Eq.  (17)  versus  R  at  different  times  as  in  Fig.  4. 


resolution,  that  is,  in  dependence  of  the  segments  of  scale  R  that  are  needed  to 
approximate  the  interface  length  (Richardson  plot),  at  different  times. 
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The  coarsening,  i.e.  the  growth  of  the  lower  characteristic  length,  leads  to  a  reduction  in 
the  available  scaling  range  at  smaller  R  .  The  width  of  the  crossover  region  from  the 
eucleadian  behaviour  at  small  R  to  the  fractal  behaviour  at  large  R  becomes  broader 
with  increasing  time.This  behaviour  can  quantitatively  be  characterized  by  the  slope  of  the 
curves  in  Fig.  5,  or  alternatively,  by  the  effective  fractal  dimension  [17] 


d  In  L(R,t) 
d\nR 


(18) 


which  depends  on  the  scale  of  observation  R  .  Deff  =  Deff{Rj)  shows  a  crossover  from 

Deff  =  1  at  small  R  (high  resolution)  to  Deff  =  Ds  at  large  R  (low  resolution)  (Fig.  6). 

For  later  times  corresponding  to  a  more  relaxed  interface,  Deff  approaches  Ds  at  larger 

R  .  This  may  have  an  effect  on  the  empirical  fractal  dimension  of  the  interface,  which  is 
obtained  by  application  of  an  empirical  observation  method,  as  the  yardstick  or  box 
counting  method,  at  a  fixed  coarsening  state. 


R 

Figure  6:  Scale  dependent  effective  fractal  dimension  Eq.  (18)  at  different  times  as  in  Fig.  4. 


If  we  consider  Deff  as  an  estimate  for  the  empirical  fractal  dimension  and  if  we  take 
into  account  that  for  a  sufficiently  coarsened  interface  Deff  may  remain  well  below  the 
asymptotic  value  Ds  in  Fig.  6  because  the  usable  scale  of  R  is  limited  by  the  existence  of 
an  upper  cutoff  defined  by  the  sample  size  (e.g.  by  the  grain  size  in  Fig.  1),  it  follows  that 
the  empirical  fractal  dimension  decreases  during  coarsening.  Hence,  for  each  coarsening 
state  of  the  grain  in  Fig.  1,  for  example,  a  somewhat  lower  empirical  fractal  dimension 


144 


than  for  the  previous  state  might  be  found.  In  contrast,  the  corresponding  coarsening 
kinetics,  as  represented  by  the  slope  of  the  straight  lines  in  Fig.  3,  is  uniquely  determined 
by  the  „initiaT  fractal  dimension  Ds  that  is  inherent  to  the  initial  self-similar  interface 
or,  as  in  the  present  statistical  model,  to  the  initial  distribution  function  Eq.  (5). 

Finally,  it  is  worth  noting  that  the  functional  form  of  the  fractal  equation  (17)  is  quite 
different  from  Rigaut’s  empirical  interpolation  Ansatz  [18]  for  concave  log-log  plots. 
Rigaut’s  interpolation  formula  has  the  form 

L  =  c^(l  +  c2RD^'y] ,  (19) 

which  tends  in  a  log-log  plot  as  Fig.  5  towards  the  horizontal  maximum  at  high  resolution 
and  towards  the  ideal  fractal  at  low  resolution.  From  the  limiting  behaviour  of  Eq.  (17)  for 
small  and  large  R  one  finds  cx=L(t)  and  c2  =(DS -l)L(t)  /  (2nB) ,  respectively, 
where  L(t)  is  given  by  Eq.  (13).  A  quantitative  comparison  shows  that  this  interpolation 

formula  represents  only  a  poor  approximation  to  the  exact  fractal  equation  (17)  in  the 
crossover  region. 


4  Summary 

After  a  discussion  of  the  dynamic  scaling  behaviour  of  self-similar  interfaces  as  it  follows 
from  analytical  scaling  assumptions  and  numerical  Monte  Carlo  simulations  a  statistical 
model  was  presented,  which  shows  all  the  main  features  of  coarsening  of  fractal  interfaces. 
The  coarsening  kinetics  of  a  two-dimensional  non-conserved  interface  system  consisting  of 
a  random  assembly  of  non-interacting  circular  loops  could  be  described  by  a  size 
distribution  function  obeying  a  continuity  equation  in  size  space.  For  a  self-similar  initial 
distribution  the  solution  of  the  continuity  equation  yielded  a  time  dependent  size 
distribution  function,  which  exhibts  a  lower  characteristic  length  scale  where  scaling 
breaks  down.  Physically  this  emerges  from  the  curvature  driven  relaxational  motion  of  the 
interfaces  leading  to  parabolic  growth  of  the  characteristic  length  scale.  The  total  interface 
length  shows  the  characteristic  power-law  decay,  where  the  coarsening  exponent  depends 
on  the  inherent  fractal  dimension  of  the  initial  interface.  The  model  allowed  us  to  calculate 
the  interface  length  as  a  function  of  resolution,  which  led  us  to  a  discussion  of  the  effect  of 
coarsening  on  the  scale  of  observation. 
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The  transport  of  surface  pollutants  in  the  coastal  zone  is  modelled  using  a  modified  particle  tracking 
diffusion  model.  The  new  model  uses  fractional  Brownian  motion  (fBm)  functions  to  produce 
superdiffusive  spreading  of  the  synthesised  pollutant  clouds.  The  model  is  tested  on  a  numerical 
model  of  a  coastal  bay  recirculation  zone. 


1  Introduction 

The  spread  of  pollutants  in  the  environment  is  a  topic  of  much  research.  Many 
environmental  pollutants  are  transported  via  turbulent  fluids  (in  rivers,  the  oceans  and  the 
atmosphere).  The  overall  spreading  of  the  pollutants  in  these  turbulent  flowfields  is 
generally  non-Fickian  (in  fact  superdiffusive)  over  some,  if  not  all,  length  scales  under 
consideration.  Traditional  methods  of  modelling  diffusive  processes  in  the  environment 
rely  on  either  solving  an  advection-diffusion  equation  on  a  computational  grid  or  using  a 
particle-tracking  technique1'3.  Both  methods  lead  to  solutions  which  are  Fickian  in  nature. 
The  spatial  correlations  which  exist  over  significantly  large  scales  result  in  a  Lagrangian 
memory  effects  within  the  flow  field.  Recently  the  authors  have  developed  a  particle¬ 
tracking  technique  that  produces  non-Fickian  diffusion4'*5.  The  technique  works  by 
incorporating  fractional  Brownian  motion  (fBm)  trajectories  within  the  particle  tracking 
model.  The  work  is  motivated  by  the  recent  discovery  that  ocean  surface  drifter 
trajectories  are  fractal  in  nature7'9.  In  this  paper  previous  work  is  expanded  and  the 
simulation  of  coastal  diffusion  in  a  numerically  generated  bay  flow  model  is  conducted. 


2  The  Non-Fickian  Particle  Tracking  Model 

Particle  tracking  models  work  by  releasing  a  large  number  of  massless  marked  particles 
into  a  known  flow-field.  These  are  then  diffused  due  to  a  Fickian  diffusion  model  and 
advected  according  to  the  spatial  distribution  of  the  flow-field.  Each  particle  represents  a 
portion  of  the  mass  of  the  contaminant,  and  the  ensemble  particle  cloud  can  be  converted 
into  a  spatial  concentration  distribution.  The  authors  generate  non-Fickian  diffusive 
behaviour  of  particle  clouds  within  a  particle  tracking  diffusion  model  by  specifying  each 
particle  path  as  a  fractional  Brownian  motion  (fBm).  A  generalisation  of  Brownian 
motion,  fBm  is  a  random  fractal  function  defined  by  Mandelbrot  and  Van  Ness  which 
exhibits  long  term  correlation  over  all  scales.  In  this  section  we  briefly  give  the  algorithm 
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for  the  generation  of  fBm,  more  specific  details  of  the  model  are  given  by  the  authors 
elsewhere,  see  for  example11'13.  Fractional  Brownian  motion  is  defined  as 


.mn.dt' R(f). 


df 


,  (i) 


where  T  is  the  gamma  function  and  R(t)  is  a  continuous  white  noise  function.  A  discrete 
approximation  to  Eq.  (1)  may  be  generated  in  two  steps:  Firstly,  incremental  steps  in  the 
fBm  walk  are  calculated  using 


r(H+y2) 


j=i-M '  J 


+  1) 


.  (2a) 


where  M  is  a  finite  memory;  and  R(tj)  are  discrete  random  numbers  with  a  Gaussian 
distribution  of  known  standard  deviation  and  zero  mean.  (Note  that  simpler  distributions 
may  be  used.)  These  incremental  steps  are  known  as  fractional  Gaussian  noise  (fGn)  - 
generalisations  of  white  noise.  The  fGn  is  then  summed  to  generate  fBm  at  discrete  times 
tj  (=At  xi) 


(2b) 


Eqs.  (2a)  and  (2b)  may  then  be  used  to  specify  the  displacement-time  behaviour  of 
individual  particles  in  each  spatial  dimension  of  the  diffusive  problem  under  investigation. 
This  requires  an  independent  realisation  of  the  fBm  for  each  spatial  co-ordinate.  Carrying 
this  out  for  each  particle  in  a  diffusing  cloud  results  in  scaling  of  standard  deviation  of  the 
cloud,  <jc ,  of  the  form 


oc  =  (2.Kf,)H 


(3) 


(here  we  do  not  include  advection),  where  Kf  is  the  fractal  (anomalous)  diffusion 
coefficient  and  H  is  the  Hurst  exponent14. 
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Figure  1:  Clouds  of  20  fBm  Traces  H  -  0.2,  0.5,  0.8.  (Note:  Different  Scales  on 
Vertical  Axes.). 

Fig.  1  shows  the  diffusion  of  clouds  of  fBm  (//=0.2,  0.5  and  0.8)  from  a  point  source.  The 
respective  subdiffusive  and  superdiffusive  nature  of  the  H=  0.2  and  0.8  fBm  is  evident  in 
the  plot.  In  order  to  generate  the  fBm  in  two  (or  more)  spatial  directions  an  fBm  trace  is 
used  for  each  independent  co-ordinate.  Figure  2  contains  2D  fBm  spatial  trajectories  for 
single  particles.  Again  the  subdiffusive  nature  of  the  i/=0.2  path  and  superdiffusive  nature 
of  the  H=  0.8  path  can  be  seen.  The  particle  tracking  model  works  by  releasing  many 
1000’s  of  particles  utilising  their  ensemble  spreading  behaviour. 


150 


Figure  2:  Two-Dimensional  fBm  trajectories. 

The  finite  memory,  M,  of  the  synthesised  fBm  is  an  important  factor  that  must  be  taken 
account  of  when  generating  the  fBm  using  the  above  method.  Memories  that  are  too  low 
result  in  a  poor  representation  of  the  fBm  over  the  problem  timescale.  Figure  3  shows  the 
effect  of  low  memory  on  the  resulting  power  law  of  the  diffusing  cloud.  In  the  figure  a  100 
step  fBm  is  generated  using  memories  from  100  to  2000  steps,  i.e.  from  one  times  the 
length  of  each  component  trace  to  twenty  times  the  length  of  each  trace.  The  gradual 
straightening  of  the  curve  is  evident  from  the  plot  indicating  that  the  synthesised  fBm 
realises  the  scaling  power  law  (Eq.  3)  as  the  memory  increases.  The  authors  have  found 
that  a  memory  of  approximately  5  times  the  problem  lengthscale  is  suitable  for  most 
practical  applications. 


Figure  3:  G  H  versus  time  for  various  values  of  memory  M. 

3  Dispersion  in  a  Coastal  Bay 

Fig.  4  shows  a  numerically  generated  surface  velocity  field  for  a  coastal  bay  with  a  North- 
South  flow.  This  main  flow  causes  a  recirculation  zone  within  the  bay  itself.  In  such 
complex  flow-fields  the  relative  change  in  the  velocity  vectors  (magnitude  and  direction) 
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can  cause  additional  dispersion  of  the  pollutant  cloud  to  take  place.  This  effect  is  known 
as  shear  dispersion. 


500  1000  1500  2000 


Figure  4:  Coastal  Bay  Model,  Surface  Velocity  Vector  Plot. 

Fig.  5  shows  the  comparison  of  two  synthesised  pollutant  clouds,  released  in  the  bay,  as 
they  are  dispersed  over  time.  The  left  hand  plots  for  each  pair  contains  the 
isoconcentration  contours  of  a  Fickian  cloud  (H=  0.5)  and  the  right  hand  plots  for  a 
superdiffusive  (//=0.8)  cloud.  Within  the  bay  itself  we  see  that  the  non-Fickian  cloud 
spreads  much  more  rapidly  than  the  (traditional)  Fickian  cloud.  This  rapid  spreading  can 
result  in  areas  being  affected  that  would  otherwise  escape  contamination  with  the  Fickian 
model.  In  addition,  lower  concentrations  are  reached  much  sooner  with  the  superdiffusive 
cloud  but  a  larger  area  is  affected.  This  has  implications  for  the  modelled  biological 
impact  of  the  pollutant  on  the  environment,  especially  for  those  organisms  whose 
mortality  depends  on  a  threshold  concentration  of  contaminant. 
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Figure  5:  Comparison  of  Fickian  (//  =  0.5)  and  Non-Fickian  (H  =  0.8)  Contaminant 
Clouds  Released  from  (1300,  2800)  at  Time  Zero.  Plots  show  location  of  cloud  after  1,  2, 
3,  4,  7,  10  hours. 
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4  Concluding  Remarks 

The  non-Fickian  particle  tracking  diffusion  model  developed  by  the  authors  allows  for  a 
more  flexible  approach  than  currently  available  to  the  modelling  of  contaminant  transport 
within  turbulent  flowfields.  The  model  has  been  illustrated  using  a  coastal  bay  flow-field, 
however,  it  has  general  applications  to  non-Fickian  diffusive  processes  in  one,  two  or 
three  dimensions.  (The  authors  have  developed  a  version  for  subsurface  diffusion  through 
material  of  variable  hydraulic  conductivities12,15).  In  the  study,  detailed  above,  the  effects 
of  shear  dispersion,  wind  shear  and  tidal  motions  were  neglected.  In  a  fully  working 
model  these  important  dispersive  processes  would  have  to  be  taken  account  of. 

One  drawback  of  the  model  is  that  H  is  restricted  to  vary  between  0  and  1 .  However,  it 
is  known  that  diffusive  processes  on  the  open  ocean  (i.e.  far  from  boundaries)  can  scale 
with  Hurst  exponents  up  to  1.5.  It  is  possible  that  by  incorporating  shear  dispersion,  super- 
diffusive  processes  may  be  realised  with  effective  exponents  above  unity.  Another 
drawback  of  the  model  is  that  the  fBm  generation  method,  given  by  Eqs.  (2a)  and  (2b), 
requires  a  heavy  computational  effort.  This  is  because  each  fGn  requires  a  summation 
over  M  time  steps,  where  the  memory  M  must  be  of  the  order  of  (at  least)  five  times  the 
duration  of  the  problem  under  investigation  to  ensure  accurate  non-Fickian  statistics  over 
the  problem  timescale.  The  authors  have  recently  developed  faster  spectral  methods  of 
generating  fBm  based  on  the  work  of  Yin16.  Current  work  by  the  authors  concentrates  on 
the  both  the  effect  of  shear  dispersion  on  fBm  dispersive  processes  and  the 
implementation  of  the  fBm  particle  tracking  model  to  a  fully  three  dimensional  numerical 
model  with  experimental  verification. 
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STOCHASTIC  SUBSURFACE  FLOW  AND  TRANSPORT  IN  FRACTAL 
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Monte  Carlo  simulations  of  subsurface  flow  and  contaminant  transport  of  a  non-reactive  solute 
plume  by  steady-state  flow  with  a  uniform  velocity  were  performed  in  a  two-dimensional  synthetic 
heterogeneous  porous  media  whose  hydraulic  conductivity  is  non-stationary  and  described  by  multi¬ 
scale  fractional  Brownian  motion.  Analysis  of  the  flow  and  transport  results  indicates  that  the 
longitudinal  velocity  variance  is  nearly  constant  in  the  longitudinal  direction  while  in  the  transverse 
direction  it  assumes  a  parabolic  shape.  The  velocity  variance  is  maximum  at  the  impervious 
boundaries  and  decreases  in  transverse  direction  with  distance  from  the  boundaries  reaching  the 
minimum  value  at  the  domain  centre.  We  observe  that  the  particle  displacement  covariance  is 
anomalous  or  non-Fickian  at  all  times  t  in  the  dispersion  process  irrespective  of  the  Hurst  exponent 
H  and  grows  temporally  faster  than  linearly. 


1  Introduction 

Natural  variability  of  subsurface  (i.e.,  hydrologic  and  geologic  systems)  hydraulic 
properties  is  the  main  factor  controlling  the  flow  and  spreading  of  contaminants  in 
subsurface  porous  media.  The  primary  physical  property  which  exhibits  large-scale 
natural  spatial  variability  is  the  hydraulic  conductivity.  One  important  source  of  error  in 
numerical  models  to  simulate  flow  and  transport  in  the  subsurface  stems  from  lack  of 
knowledge  concerning  the  spatial  variability  of  hydraulic  conductivity.  In  practice,  we 
typically  have  sparse  data  which  is  inadequate  to  completely  describe  the  spatial 
variability.  This  is  a  consequence  of  the  high  data  collection  cost  required  to  fully 
characterise  a  given  subsurface  porous  medium.  Hence,  the  variation  of  subsurface 
hydraulic  conductivity  cannot  generally  be  described  in  all  its  detail  and  is  therefore 
uncertain1. 

One  response  to  the  problem  of  the  uncertainty  of  subsurface  hydraulic  conductivity  has 
been  an  interest  in  stochastic  methods,  which  provide  a  formal  framework  for  the 
treatment  of  uncertainty.  Spatial  variability  of  subsurface  properties  is  such  that  their 
unique,  deterministic,  description  is  not  feasible2,  and  this  is  formally  recognised  in  the 
stochastic  approach,  in  which  subsurface  hydraulic  conductivity  is  commonly  regarded  as 
a  random  scalar  field  (RSF). 

Stochastic  theories  involve  the  description  of  the  local  porous  medium  structure  using  a 
statistical  model  that  requires  a  small  number  of  parameters  to  be  identified  from  field 
measurements.  The  detailed  spatial  distribution  of  logconductivity  is  conventionally 
reduced  to  a  few  statistical  parameters,  for  example,  covariance  function,  mean,  variance, 
and  correlation  length.  It  thus  follows  that  the  outputs  from  stochastic  models  are 
probabilistic,  characterised,  for  example,  by  the  statistical  moments  or  the  full  probability 
density  function  of  the  variable  of  interest. 

In  this  study,  we  perform  Monte  Carlo  experiments  by  generating  a  large  number  of 
independent  random  conductivity  realisations  with  a  fractal  semivariogram  function, 
solving  the  stochastic  boundary  value  problem  by  repetitively  solving  a  set  of 
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deterministic  flow  problems,  each  of  which  is  an  equally  probable  representation  of  the 
response  of  the  real  heterogeneous  medium.  The  flow  problem  is  solved  by  employing  a 
block-centred  finite  difference  scheme  (e.g.,  Smith  and  Freeze3)  to  obtain  subsurface 
velocity  fields.  A  random  walk  particle  tracking  algorithm  is  used  to  solve  the  transport 
equation  (e.g.,4'7)  using  realised  velocity  fields.  Realised  solutions  are  averaged  to  get 
fluctuating  log  conductivity  semivariograms,  velocity  variances,  particle  displacement 
covariance  and  mean  solute  concentrations. 


2  Random  Field  Model 

In  the  Monte  Carlo  simulation  of  subsurface  flow  and  transport,  the  first  step  is  the  choice 
of  the  statistical  model  that  represents  the  medium  heterogeneity,  mainly  the  log  hydraulic 
conductivity,  InA^x) .  Most  stochastic  theoretical  and  numerical  modelling  approaches 

(see  e.g.7'9)  assume  that  the  log  hydraulic  conductivity  r(x)  =  In  K(\)  where  /:(x)  is  the 

local  hydraulic  conductivity  of  the  subsurface  porous  medium  and  x  is  the  vector  of 
spatial  co-ordinates,  as  stationary  random  scalar  field,  normally  distributed  and 
characterised  by  a  constant  mean  (y)  and  isotropic  exponential  covariance  functions  e.g., 

CY(r)  =  o\e~rix  (with  r  =  |r| ,  where  r  is  the  two-point  isotropic  lag,  g\  is  the  variance 
of  the  distribution,  and  X  the  finite  integral  scale).  The  justification  for  using  this  model 
is  based  on  geostatistical  data  obtained  from  several  sites  (e.g.,10).  In  addition,  the 
macrodispersivity  measured  from  tracer  tests  agree  approximately  well  with  the  stochastic 
theory  results  based  on  the  exponential  model.  Zhan  and  Wheatcraft11  argue  that,  field 
hydraulic  conductivity  measurements  are  limited,  have  large  uncertainty  and  have  been 
carried  out  for  relatively  small  spatial  scales  (at  most  a  kilometre).  If  the  measurements 
are  available  at  much  larger  spatial  scales,  conductivity  values  may  remain  correlated  at 
larger  scales  and  may  give  rise  to  non-stationary  fractal  or  self-similar  distributions12*14 
with  infinite  correlation  length.  Neuman15  proposes  a  spatial  scaling  assumption  with  a 
semivariogram  function  given  in  (1)  to  describe  such  a  distribution  as 

i 

Mr)  =  {([>1*  +  r)  -  n*)f)  =  ro(r/r  >  (1) 

where  X  is  a  reference  scale,  y0  is  a  constant  variance  parameter,  and  the  scaling 

exponent  H  is  the  Hurst  exponent16.  Equation  (1)  has  been  demonstrated  to  be  valid  for 
self-affine  stochastic  processes  over  the  broad  range  0<H<\24.  The  exponent  H  is 
associated  with  the  fractal  dimension  Ds  =  E  +  \-H,  where  E  is  the  space 

dimensionality.  For  0<//<l  the  variance  and  integral  scale  of  the  field  Y  grow  infinitely, 
thus  describing  a  medium  with  spatially  evolving  heterogeneity.  The  semivariogram, 
given  by  (1),  shows  a  continuous  growth  with  distance.  The  presence  of  more  than  one 
reference  scale  of  variability  has  an  influence  on  fluid  flow  and  contaminant  migration6. 
Spatially  evolving  scale  formations  are  characterised  by  a  variance  which  keeps  growing 
with  the  domain  size.  For  a  bounded  domain,  the  order  of  magnitude  of  the  variance  is 
given  from  (1)  by  the  following: 
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4-r  o*2H>  (2) 

where  R  is  the  characteristic  dimension  of  the  domain  (upper  fractal  cut-off  scale). 
Hence,  the  log  conductivity  integral  scale  X  shows  the  following  scaling  property: 

XccaR  (3) 

where  a  is  a  constant  that  depends  on  H .  For  independent  realisations  of  the  log 
conductivity  field,  the  variance  of  the  random  processes  varies  because  of  lack  of 
stationarity.  However,  averaging  over  many  realisations,  the  order  of  magnitude  of  the 
variance  is  controlled  by  the  above  relationship.  The  spatial  variability  of  the  field  is 
controlled  by  H  while,  for  a  given  H ,  the  contrast  existing  between  the  actual  values  of 
log  conductivity  Y  is  controlled  by  y0  •  Increasing  y0 ,  keeping  H  constant,  amplifies  the 
contrast  between  block  conductivites.  To  perform  hydraulic  conductivity  simulations  for 
different  H  values,  the  same  order  of  magnitude  of  the  fluctuations  needs  to  be 
maintained.  This  was  needed  in  order  to  obtain  realistic  fluctuations  for  the  case  H>  1/2  . 
From  a  practical  point  of  view  we  impose  in  all  cases  the  following  condition: 

y0R2H=<T2y,  (4) 


where  oy  is  the  variance  of  the  field  at  the  maximum  size  of  the  modelled  domain  (upper 
cut-off  scale).  This  means  that  once  we  fix  the  value  y0  for  the  case  H<  1/2 ,  say  [/o]]  for 
H  =  HU  the  coefficient  [y0]2  for  the  case  H  =  H2>  1/2  becomes: 


M2 


k«]g 

r2H>  ■ 


(5) 


Field  measurements  of  hydraulic  conductivity  have  indicated  an  approximately  log-normal 
distribution10.  The  same  distribution  has  therefore  been  adopted  for  the  simulations 
generated  in  this  study.  Essentially,  a  normally  distributed  log  conductivity  field  Y(x,y)  of 

stationary  increments  and  isotropic  semivariogram  given  in  (1)  is  generated  over  a  two- 
dimensional  domain,  the  hydraulic  conductivity  field  is  obtained  by  the  transformation 

X(x,y)  =  exp[y(x,y)],  (6) 

where  K(x,y)  is  the  conductivity  assigned  to  the  point  of  the  domain.  The  turning 

band  method19,20  was  used  to  simulate  the  hydraulic  conductivity  fields.  Using  this 
approach  we  do  not  require  to  filter  the  wavelengths  larger  than  the  field  dimension 
(fractal  cut-off  scale),  from  the  Y  spectrum.  The  larger  scale  of  variability  is  limited  by 
the  field  dimension,  and  the  cut-off  is  introduced  by  the  fact  that  the  generation  is 
performed  in  a  finite  domain  similar  to  the  work  of  Beilin  et  al.,6.  Since  the  log 
conductivity  field  Y  is  non-stationary,  the  mean  value  of  Y  is  maintained  constant  in  each 
realisation  of  the  field  Y,  by  conditioning  the  field  to  a  given  constant  value21.  This 
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translates  to  conditioning  the  mean  velocity  field  to  a  given  constant  value.  Figure  (la) 
and  (lb)  shows  two  realised  fractal  hydraulic  conductivity  fields  generated  using  the 
turning  bands  method.  The  simulated  fields  pertain  to  //  =  0.3,  and  H  =  0.8  respectively. 
These  fields  show  dramatically  different  behaviour.  When  H>  1/2,  Y  is  positively 
correlated  and  the  fields  shows  smooth  variations  and  large-scale  persistence22  of  the 
positive  and  negative  values,  Fig.  lb.  Conductivity  values  tend  to  cluster  above  or  below 
the  mean  for  quite  some  distance  before  they  change  to  the  other  side  of  the  mean.  When 
H<  1/2 ,  Y  is  negatively  correlated  and  leads  to  less  persistence  within  the  conductivity 
values,  and  hence  Y  varies  erratically,  Fig.  1  a. 


(a)  H  =  0.3  (b)  H  =  0.8 

Figure  1:  Fractal  Hydraulic  Conductivity  Fields  Simulated  by  Turning  Bands  Method. 

To  address  model  accuracy,  we  consider  the  following  statistic: 


rKr)=^|jy(x>)-y(x>-r)l2  (7) 

where  N  denotes  the  number  of  pairs  of  log  hydraulic  conductivity  Y  with  separation 
equal  to  r  .  >4(r)  is  the  semivariogram  obtained  by  spatial  averaging  all  the  equi-distant 

pairs  in  a  single  realisation  over  a  single  replicate  and  assuming  stationarity.  The 
reconstructed  expected  value  of  the  log  conductivity  semivariograms  for  //  =  0.3,0.8,  are 

shown  in  Figure  (2a)  and  (2b)  respectively.  The  expected  values  are  computed  in  a  Monte 
Carlo  sense  averaging  over  a  number  of  MC  independent  realisations  as 

yycir) = (rsr(r)) = rY{j](r)  (8) 

according  to  the  following  convergence  criteria6 


(9) 
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where  rk  is  the  distance  of  the  two  point  lag  along  the  North-South  (NS)  and  East- West 
(EW)  directions  of  the  modelled  domain  and  M  is  the  number  of  points  used  to  discretise 
the  semivariogram.  The  simulated  and  theoretical  semivariograms  (Eq.  (1))  along  NS 
(squares)  and  EW  (triangles)  directions  for  different  H  values  agree  well  in  both  cases  in 
Fig.  2  although  the  accuracy  deteriorates  with  lag.  The  plotted  semivariograms  shows  that 
the  variance  grows  infinitely  with  lag  distance  for  a  medium  described  by  a  semivariogram 
given  in  Eq.  1 . 


Figure  2:  Simulated  YyC{ r)  and  *heoretical  Yy (r)  =  a{rt^fH  semivariograms  along  NS  and  EW  for  fields 
in  Figure  (1). 


3  Stochastic  Finite  Difference  Flow  Model 

Steady-state  confined  flow  of  groundwater  in  a  two-dimensional,  horizontal,  saturated, 
imcompressible  porous  medium  with  physical  heterogeneity  represented  by  simulated 
spatially  varying  fractal  conductivity  fields  is  considered  in  this  study.  Flow  simulations 
were  performed  in  a  Monte  Carlo  manner  for  fractal  fields  of  various  H  values.  The 
simulation  domain  is  a  square  domain  where  constant  hydraulic  heads  are  assigned  at  the 
left  and  right  boundary  and  the  no-flow  condition  on  the  top  and  bottom  to  create  a 
uniform  mean  velocity  v  .  The  numerical  simulations  are  performed  for  fixed  domain  of 
252x252 .  Beilin  et  al1  suggest  that  flow  and  transport  solutions  are  not  affected  by 
refinements  of  the  grid  size  involving  more  than  four  points  per  integral  scale  2  .  Here,  we 
choose  the  following  grid  sizes  in  the  longitudinal  and  transverse  directions, 
Ax  =  Ay  =  2/4  . 

Steady-state  subsurface  flow  in  the  2-d  domain  is  described  by 

V.q(x)  =  V.(-tf(x)V^(x))  =  0  (10) 


where  q(x)  is  the  Darcy  flux  relative  to  the  solid  matrix  and  ^(x)  is  the  hydraulic  head. 

The  flow  equation  is  discretised  by  employing  a  mesh  centered-finite  difference  scheme. 
The  hydraulic  heads  at  the  nodal  points  are  solved  with  the  boundary  conditions  given 
above  by  a  Gauss-Seidel  method  with  successive  over  relaxation  (SOR).  In  order  to 
compute  nodal  heads,  the  interblock  hydraulic  conductivities  are  computed  by  geometric 
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averaging  of  adjacent  conductivity  values.  This  form  of  averaging  ensures  continuity  of 
the  head  field  and  conservation  of  mass  flux  across  block  boundaries17.  Darcy’s  law  is 
applied  to  obtain  the  flux  components  q(x)  and  nodal  velocities  are  computed  by  finite 
differencing  with  a  constant  porosity  value  of  77  =  0.3 . 

In  order  to  check  the  accuracy  and  efficiency  of  the  finite  difference  approximation  of 
the  flow  equation,  we  calculated  the  local  mass  balance  error  which  can  be  defined18  as 

e  =  {QoM-Qin)/QinX  100%  (11) 

where  Q,ui  and  Qn  are  the  total  volume  of  flow  out  of  the  right  boundary  and  into  the  left 
boundary,  respectively.  The  mass  balance  error  decreases  when  a  smaller  value  of 
convergence  criterion,  ju ,  in  the  iterative  scheme  is  used  (the  head  changes  in  all  nodes 
between  two  iterations  is  less  than  or  equal  to  // ,  iteration  stops).  We  found  that  for  each 
Monte  Carlo  simulation  e  is  always  less  than  4%  for  each  Hurst  exponent  H .  These 
result,  show  that  the  finite  difference  solution  is  reasonably  accurate,  and  accordingly,  the 
obtained  velocity  field  is  sufficiently  accurate  to  be  used  in  the  transport  model.  The  mean 
velocity  computed  for  Hurst  exponent  and  with  reference  to  1500  Monte  Carlo 
realisations  was  equal  to  unity  leading  to  the  conclusion  that  the  mean  velocity  can  indeed 
be  assumed  constant  and  unit  in  each  realisation. 


4  Stochastic  Solute  Transport  Model 

The  random  walk  particle  tracking  method  is  commonly  used  in  the  field  of  statistical 
physics  to  model  processes  involving  diffusion.  This  approach  has  also  been  used 
successfully  to  simulate  reactive  and  non-reactive  transport  in  the  subsurface4"7.  The  basic 
idea  is  to  approximate  the  spatial  distribution  of  a  transport  quantity  by  a  set  of  moving 
particles.  The  spatial  location  of  particles  are  updated  at  each  time  step  according  to  the 
following  equation23 

x(/  +  A/)  =  x(r)  +  [v(x,/)  +  V-d(v(X,/))]A(+[2d(v(X,/))A/]l/2  Z  (12) 


where  x(/  +  At)  is  the  updated  position  of  the  particle  that  was  at  x(<)  in  the  previous 
step,  V(x,r)  is  the  velocity  vector  at  the  old  position  at  timet,  d  is  the  local  scale 

dispersion  tensor,  At  is  the  time  step,  and  Z  is  a  vector  of  normally  distributed  random 
numbers  of  zero  mean  and  unit  variance.  The  second  term  on  the  right-hand  side  moves 
the  particle  advectively  on  the  basis  of  the  local  velocity  field  at  each  point.  The  third  term 
is  important  when  stagnation  regions  or  sharp  fronts  exist  within  the  field23.  The  last  term 
accounts  for  the  local  scale  dispersion.  The  particle  velocity  needed  in  (12)  is  obtained  by 
using  a  bilinear  interpolation  utilising  the  four  velocity  values  surrounding  the  particle 
position.  Eq.  12  is  used  to  track  2000  non-interacting  particles  initially  distributed  along  a 
strip  of  length  6X  normal  to  the  mean  flow  direction  of  the  domain.  A  constant 

dimensionless  time  step  a/  =  0.05  was  chosen  such  that  the  fraction  of  the  cell’s  length 
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traversed  by  a  particle  in  a  single  time  step  is  much  smaller  than  one.  The  dimensionless 
time  is  given  by  i  =  vt/X .  The  particle  tracking  experiments  are  performed  in  the  inner 
core  of  the  flow  domain  that  is  not  affected  by  the  boundaries.  Particle  tracking  is 
terminated  during  each  Monte  Carlo  run  before  the  contaminant  plume  exits  the  inner  core 
of  the  modelled  domain. 


5  Results  from  Flow  and  Transport  Stochastic  Analyses 

Numerical  generation  of  synthetic  hydraulic  conductivity  fields,  flow  computations  and 
transport  simulations  were  carried  out  in  a  Monte  Carlo  manner  for  each  Hurst  exponent 
H .  The  simulation  was  terminated  when  the  convergence  criteria  given  in  (9)  were 
satisfied.  For  each  Hurst  exponent  at  least  1500  Monte  Carlo  runs  were  required  to  attain 
convergence.  Numerical  analyses  were  then  carried  out  to  calculate  mean  statistical 
quantities  of  interest  from  the  flow  and  transport  simulations. 

Beilin  et  aln  show  that  the  presence  of  boundaries  influences  the  hydaulic  head  and 
velocity  variabilities  in  two-dimensional  domains.  In  Fig.  3,  we  plot  results  for  the  head 
variances  <j2h  for  different  values  of  H  averaged  over  1500  Monte  Carlo  realisations. 

The  head  variability  assumes  a  parabolic  shape  for  each  value  of  H  .  The  parabolic  shape 
of  the  curves  reflects  the  statistical  heterogeneity  of  the  standard  deviation  in  hydraulic 
head  due  to  the  constant  head  values  specified  on  the  boundaries  of  the  flow  domain.  The 
head  variance  is  zero  at  the  fixed  boundaries  as  expected  and  increases  at  the  centre  of  the 

domain  where  it  is  maximum.  The  variance  of  hydraulic  head  a\ ,  increases  with  H  . 


Figure  3:  Head  Variance  cj2h  along  the  Longitudinal  Axis  for  different  values  of  H  . 


Figs.  4a  and  4b  show  the  average  Monte  Carlo  longitudinal  velocity  variance  a2  along 
the  center  line  in  the  longitudinal  direction  (dashed  curve),  and  also  the  center  line  along 
the  transverse  direction  (solid  curve),  for  H  =  0.4  and  0.8  respectively. 
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Figure  4:  Velocity  Variance  g\  Along  the  Centreline  in  the  Longitudinal  (Dashed)  and 
Transverse  (Solid)  Direction. 

While  in  the  longitudinal  direction  o\  is  nearly  constant,  in  the  transverse  direction  it 

assumes  a  parabolic  shape.  The  longitudinal  velocity  variance  is  maximum  at  the 
impervious  boundaries  and  decreases  in  transverse  direction  with  distance  from  the 
boundaries  reaching  the  minimum  value  at  the  domain  centre. 

Travel  time  statistics  are  important7  because  they  are  robust  in  characterising  the 
dispersion  process  blending  all  sources  of  uncertainty  into  a  unique  curve.  Fig.  5 
illustrates  travel  time  statistical  distributions  (breakthrough  curves)  at  three  distinct 
absorbing  barriers  placed  normal  to  the  mean  flow  respectively  at  distances  5  A ,  l(U  ,  and 
15A  computed  by  counting  the  number  of  particles  the  cross  each  barrier  for  each 
transport  simulation  time.  The  figure  is  the  average  of  1500  Monte  Carlo  simulations  for 
the  case  H  =  0.7 .  This  result  shows  that  as  the  plume  travels  further  from  the  contaminant 
source,  the  increased  spreading  of  the  plume  results  in  both  a  corresponding  attenuation 
and  reduction  in  peak  value  of  the  breakthrough  curves. 


Figure  5:Travel  Time  Statistics  for  H  ~  0.7  . 


At  the  end  of  each  Monte  Carlo  run,  the  moments  of  the  dispersing  plume  are  computed  as 
follows:  (a)  the  trajectory  of  the  plume  centroid: 


P  P=l 


(13) 
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where  X^p(t)  (/  =  1,2 )  are  the  coordinates  of  the  p  -th  particle  for  the  realisation  m  and 
NP  is  the  total  number  of  particles;  (b)  the  second-order  central  plume  moment: 


s?(t) = — !—  i  f  xrp(t)  -  Rr{t)\xj-p(t)  - «;(/)]  (i4> 

The  moments  (13)  and  (14)  represent  the  overall  plume  behaviour  in  each  one  of  the 
(equally  probable)  log  conductivity  fields.  Average  trajectories  («,(/)) ,  and  the  average  of 

the  second  spatial  moments  (.%(/))  are  obtained  by  taking  expectations  over  MC 
independent  Monte  Carlo  realisations: 

m--sr  s'fw  <15) 


W)= 


1  MC  ,  , 
MC„=X  ‘n> 


(16) 


Figure  6:  Particle  Displacement  Covariances  —  Su(0)  against  t  —vt/X  for  different  values  of  H  . 


The  expected  values  (15)  and  (16)  represent  the  most  probable  estimate  of  the  actual 
plume  behaviour.  Fig.  6  represents  the  average  of  1500  simulated  longitudinal  second 
spatial  moments  (5„(t))- 5u(o)  as  a  function  of  the  dimensionless  time  t  =  vt/A  for 
various  values  of  H .  Several  observations  can  be  made;  as  the  value  of  the  Hurst 
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exponent  increases  the  second  spatial  moment  increases  with  dimensionless  time,  a  result 
that  is  consistent  with  the  findings  of  Hassan  et  al.5.  For  each  value  of  the  Hurst  exponent 
the  second  spatial  moment  increases  non-linearly  (non-Fickian)  for  all  simulation  times. 
This  result  can  be  explained  by  the  fact  that  as  the  solute  migrates  in  the  porous  medium, 
it  continuously  encounters  spatially  evolving  scales  of  heterogeneity  (fractal),  hence  the 
dispersion  remains  non-Fickian  for  all  length  scales  below  the  scale  of  heterogeneity  of 
the  modelled  domain  (upper  fractal  cut-off  scale  R ). 


6  Summary 

A  two-dimensional  numerical  fractal-based  Monte  Carlo  simulation  model  is  developed  to 
study  flow  and  transport  in  heterogeneous  media  of  spatially  evolving  heterogeneity. 
Realisations  of  the  log  fluctuating  conductivity  field  are  generated  from  a  fractal  or  non- 
stationary  distribution.  The  long  range  correlation  of  the  hydraulic  conductivity  field  is 
evident  in  the  numerically  obtained  semivariograms.  The  flow  simulations  are  carried  out 
by  solving  a  finite  difference  equation  over  a  two-dimensional  domain  with  uniform  mean 
flow  in  one  direction.  The  velocity  field  obtained  from  the  conductivity  realisations 
exhibits  long  range  correlations.  A  random  walk  particle  tracking  technique  is  used  to 
solve  the  solute  transport  problem  for  non-reactive  solutes.  The  transport  simulations 
indicate  that  higher  Hurst  exponent  H  (more  persistence)  of  log  conductivity  results  in 
more  longitudinal  spreading  of  the  contaminant  plume.  Results  also  show  that  irrespective 
of  the  Hurst  exponent  the  transport  process  is  non-Fickian  at  all  times  for  a  dispersion 
process  below  the  upper  fractal  cut-off  scale  R . 
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Iterated  Function  Systems  (IFS)  have  been  widely  used  for  image  compression  and  for  generating 
fractal  objects.  Using  baiycentric  coordinates  and  an  extension  of  the  concept,  a  range  of 
rendering  methods,  such  as  Lambert,  Gouraud  and  Phong  shading,  can  be  generated  using  IFS.  By 
rendering  a  scattered  collection  of  individual  points  using  z-buffer  and  shadow  buffer,  the 
problems  of  clipping,  hidden  surface  elimination  and  shadow  generation  are  reduced  to  very 
simple  forms.  The  method  is  relatively  simple  to  program,  but  will  not  achieve  the  high  speeds  of 
current  sophisticated  rendering  methods. 


1  Iterated  functions  systems  and  the  chaos  game 

Iterated  function  systems  (IFS)  have  been  used  to  generate  models  of  fractal  objects1, 2  and 
for  image  compression3, 4.  From  the  definitions  of  relatively  few  functions,  often  complex 
and  unique  figures  can  be  drawn.  Given  n  contractive  transformations  {6,  fi,  £,  ...  £-1} 
and  an  arbitrary  starting  point,  a  unique  image  in  2D  or  object  in  3D  is  created  by: 

Given  an  initial  point  Po 

Loop  for  i  =  1  to  a  large  value 
Select  junction  £  at  random  from  {fb,  fi,  ...  £-1} 

Set  point  Pi  =  f(Pt-i) 

Plot  point  Pi 

End  loop 

This  algorithm,  often  referred  to  as  ‘the  chaos  game’,  generates  a  Markov  chain5  of 
points,  each  new  point  dependent  only  on  the  previous  one.  Some  stray  points  may  be 
created  initially  (avoided  by  looping  a  few  times  before  plotting  or  by  setting  P0  inside 
the  object),  but  points  are  soon  ‘attracted’  into  the  image  or  object  defined,  the  ‘attractor’ 
cannot  then  be  escaped.  Contractive  transformations  reduce  the  distance  between  distinct 
points.  Uniform  point  density  for  each  transformation  is  achieved  when  the  probability  of 
selection  is  proportional  to  contraction  ratios  (the  fractional  area  or  volume  reduction),  but 
uniform  point  density  within  the  image  is  only  achieved  if  the  transformations  have  non¬ 
overlapping  image  sets. 

The  theoretical  development  of  IFS1,  2  and  most  subsequent  uses  concentrate  on 
affine  transformations,  familiar  to  computer  graphics  experts  as  combinations  of 
translations,  scalings  and  rotations6.  These  standard  rules  have  been  relaxed  in  a  number 
of  studies,  for  example,  Groller7  added  ‘tapering’  and  ‘twisting’  functions,  Frame  and 
Angers8  used  higher  level  polynomials,  Jones  and  Campa9  used  randomised  functions  and 
Jones  and  Moar  0  used  functions  involving  moduli.  It  has  been  shown  by  Hart11  that, 
‘even  the  contractivity  condition  can  be  weakened  to  so  called  eventual  contractivity’ . 

A  particularly  simple  example  of  the  ‘chaos  game’  that  generates  a  Sierpinski  gasket 
or  triangle  (Fig.  1)  is 
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Given  three  vertices  V0,  Vi,  V2 
Set  point  P0  =  Vo 
Loop  for  i  =  1  to  a  large  value 
Select  vertex  Vi  at  random  from  {V0,  Vi,  V2} 
Set  point  Pi  =  0.5(Vi  +  Pm) 

Plot  point  Pi 
End  loop 


Figure  1.  A  Sierpinski  gasket,  based  on  an  equilateral  triangle,  created  by  the  chaos  game  algorithm.  The 
method  produces  similar  results  for  any  form  of  triangle. 


Point  Pi  is  halfway  between  the  previous  point  Pm  and  the  randomly  selected  vertex  Vi. 
Setting  P0  =  Vo  puts  the  initial  point  inside  the  attractor,  so  no  stray  points  are  created. 
As  well  as  regular  geometric  forms,  IFS  can  successfully  create  plant  like  images 
exhibiting  self  similarity2, 10. 


2  Barycentric  coordinates 

A  point  P  in  a  triangle  V0ViV2  (Fig.  2)  can  be  represented  by  barycentric  coordinates  (a, 
b)  where 


P  -  Vo  +  a(Vi  -  Vo)  +  b(V2  -  Vo), 


using  P,  Vo,  and  so  on  as  position  vectors.  This  is  a  unique  definition  of  P,  with  a  >  0,  b 
>  0  and  (a  +  b)  <  1 . 


Figure  2.  Barycentric  coordinates  (a,  b)  identify  an  interior  point  in  a  triangle. 
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These  conditions  are  easily  seen  by  identifying  barycentric  coordinates  as  a  mapping  of 
the  general  triangle  V0V1V2  to  V'oV'iV^,  where  V'o  =  (0,  0),  V’i  —  (1,  0)  and  V2  —  (0,  1), 
in  a  2D  Cartesian  space  (Fig.  3). 


Figure  3.  Mapping  of  the  triangle  V0ViV2  into  a  Cartesian  space  to  show  conditions  on  a  and  b. 

a  and  b  are  the  fractions  of  edges  V0Vi  and  V0V2  that  are  traversed  in  order  to  reach  P  from 
Vo.  Rearranging  the  definition  of  P  gives  its  position  in  terms  of  the  triangle  vertices  as 


P  =  (1  -a-b)V0  +  aVi  +  bV2, 

clearly  a  weighted  mean  of  Vo,  Vi,  V2  with  weights  (1  —  a  —  b),  a  and  b.  Conditions 
on  a  and  b  show  these  weights  are  non-negative  and  add  to  one,  so  P  is  the  centre  of  mass 
of  (1  -  a  -  b),  a  and  b  placed  at  Vo,  Vi,  V2  respectively.  This  is  analogous  to  use  of 
blending  functions  in  spline  curves,  is  independent  of  order  of  selection  of  vertices  and  has 
implications  for  triangle  shading. 


3  The  chaos  game  and  barycentric  coordinates 

The  functions  implicit  in  the  ‘chaos  game’  map  points  from  within  the  initial  triangle  to 
a  half  scaled  triangle  located  at  the  randomly  selected  vertex,  for  example  when  vertex  V0 
is  selected,  Fig.  4  shows  the  shaded  half  scale  triangle,  V0W2Wi  (Wi  is  the  edge  mid 
point  opposite  Vi).  In  the  chaos  game,  if  Pm  has  barycentric  coordinates  (a,  b), 

Pi-i  =  (1  -  a  -  b)Vo  +  aVi  +  bV2, 

the  barycentric  coordinates  of  the  subsequent  point  Pi  =  0.5(Vi  +  Pm)  can  be  found  for 
each  selection  of  Vi. 

L  When  Vi  =  Vo, 

Pi  =  (1  -  0.5a  -  0.5b) V0  +  0.5aVi  +  0.5bV2, 
so  the  barycentric  coordinates  are  (0.5a,  0.5b); 

2.  whenVi  =  Vi, 

Pi  =  (0.5  -  0.5a  -  0.5b) Vo  +  0.5(1  +  a)Vi  +  0.5b V2 
=  (1  -  0.5(1  +  a)  -  0.5b)Vo  +  0.5(1  +  a)Vi  +  0.5bV2, 
so  the  barycentric  coordinates  are  (0.5(1  +  a),  0.5b); 
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3.  whenVi=V2, 

Pi  =  (0.5  -  0.5a  -  0.5b)V0  +  0.5aV!  +  0.5(1  +  b)V2, 

=  (1  -  0.5a  -  0.5(1  +  b))V0  +  0.5aVi  +  0.5(1  +  b)V2, 
so  the  barycentric  coordinates  are  (0.5a,  0.5(1  +  b)). 

Thus,  the  version  ofthe  chaos  game  that  generates  a  Sierpinski  triangle  can  be  rewritten 
as: 


Given  three  vertices  V0,  Vi,  V2 
Set  a  =  0,  b  =  0 
Loop  for  i  =  1  to  a  large  value 
Select  j  at  random  from  {0,  1,  2} 

Case 

j  =  0:  set  a  -  0.5a,  b  =  0.5b; 
j  =  1 :  set  a  =  0.5(1  +  a),  b  =  0.5b; 
j  =  2:  set  a  =  0.5a,  b  =  0.5(1  +  b); 

End  case 

Set  point  Pi  to  have  barycentric  coordinates  (a,  b) 

Plot  point  Pi 
End  loop 

This  is  slightly  more  complicated  than  the  original  form,  but  has  interesting  implications 
discussed  later.  Note  that  multiplication  by  0.5  may  be  implemented  very  efficiently  in 
most  programming  languages. 


4  Rendering  by  IFS 

4. 1  Triangle  rendering 

Standard  rendering  algorithms  ‘fill’  a  polygon  with  colour.  Some  algorithms  require  that 
all  the  polygon  faces  are  reduced  to  sets  of  triangles,  so  the  triangle  is  the  essential  shape. 
The  chaos  game  applied  to  triangle  V0ViV2  maps  the  original  triangle  to  each  ofthe  sub¬ 
triangles  V0W2W,,  W2V1W0  and  W1W0V2  leaving  internal  triangle  W0W1W2  unfilled 
(Figs.  1,  5).  The  ‘collage  theorem’2  indicates  that  a  shape  is  generated  by  covering  it 
with  copies  of  itself,  so  to  fill  V0V1V2,  a  fourth  transformation  is  needed  to  map  itself  into 
the  half  scale  congruent  triangle  W0W1W2.  The  point  P  with  barycentric  coordinates  (a, 
b)  in  V0V1V2  should  be  mapped  into  P1  with  the  same  barycentric  coordinates  (a  b)  in 
W0W,W2  (Fig.  5).  This  sets 

P'  =  (1  -  a  -  b)W0  +  aW,  +  bW2. 

The  Wj  are  the  mid  points  of  triangle  edges,  so 

P'  =  0.5(1  -  a  -  b)(Vi  +  V2)  +  0.5a(V0  +  V2)  +  0.5b(Vo  +  Vi), 

which  can  be  rearranged  as 

P’  =  (1  -  0.5(1  -  a)  -  0.5(1  -  b))V0  +  0.5(1  -  a)V,  +  0.5(1  -  b)V2, 

showing  that  the  barycentric  coordinates  ofP'  in  the  original  triangle  V0ViV2  are  (0.5(1  - 
a),  0.5(1  —  b)).  This  again  finds  the  point  P'  as  a  weighted  mean  of  triangle  vertices  Vo, 
Vi,  V2  with  weights  0.5(a  +  b),  0.5(1  -  a)  and  0.5(1  -  b)  respectively.  The  weights  are 
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all  between  0  and  1  due  to  similar  conditions  on  a  and  b,  and  they  add  to  one,  so  this  is 
a  valid  weighted  mean.  With  equal  probabilities  of  selection  of  non-overlapping 
transformations,  each  with  equal  contraction  ratio  of  0.5,  triangle  V0ViV2  is  filled  with 
uniform  point  density  by  the  following  adapted  chaos  game. 

Given  three  vertices  Vo,  Vi,  V2 

Set  a  =  0,  b  =  0  //  arbitrarily  chooses  Vo  as  the  first  point 

Loop  for  i  =  1  to  a  large  value 
Select  j  at  random  from  {0,  1,  2,  3} 

Case 

j  =  0:  set  a  =  0.5a,  b  =  0.5b; 
j  =  1:  set  a  =  0.5(1  +  a),  b  -  0.5b; 
j  =  2:  set  a  =  0.5a,  b  =  0.5(1  +  b); 
j  -3:  set  a  =  0.5(1  -  a),  b  =  0.5(1  -  b); 

End  case 

Set  point  Pi  to  have  barycentric  coordinates  (a,  b) 

Plot  point  Pi 

End  loop  _ 


V0 


Figure  4.  Scan  line  rendering:  the  internal  point  P  lies  on  a  scan  line  through  Q  and  R 


In  scan  line  rendering,  the  colours  of  interior  points  of  polygons  are  calculated  by  linear 
interpolation,  initially  along  edges  and  then  along  a  scan  line,  of  pre-calculated  colour  (for 
Gouraud  shading)  or  surface  normals  (for  Phong  shading)  at  polygon  vertices  In  Fig.  4, 
a  scan  line  cuts  the  ‘active*  edges  V2Vo  and  V2Vi  at  Q  and  R.  If  I(Vi)  is  the  colour 
intensity  at  Vi,  the  colour  at  Q  is 

I(Q)  -  (1  -  q)I(V2)  +  ql(Vo), 

where  q  =  V2Q/V2V0.  With  r  -  V2R/V2Vi 

I(R)  =  (1  -  r)I(V2)  +  rI(Vi), 


and  with  p  =  QP/QR, 


I(P)  =  (1  -  p)I(Q)  +  pi(R) 

In  practice,  these  formulae  are  used  to  pre-calculate  offset  colour  changes  and  positional 
changes  ofP,  Q  and  R  for  horizontal  and  vertical  pixel  moves. 
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The  method  has  a  slight  distorting  effect,  as  the  perspective  projection  of  the  3D  triangle 
is  non-affine.  Points  closer  to  the  observer  should  display  smaller  colour  change  per  pixel, 
but  the  method  gives  a  uniform  change  in  the  image  space,  rather  than  in  the  world 
space  .  The  method  is,  in  any  case,  a  distortion  of  the  correct  physical  model.  The 
algorithms  for  Gouraud  and  Phong  shading  allow  updating  of  the  active  edge  list  when 
the  scan  line  passes  a  vertex,  so  ‘spans’  like  QR  can  be  easily  identified  and  updated  for 
polygons  other  than  triangles. 

For  a  triangle  with  an  interior  point  P  known  through  its  barycentric  coordinates  (a, 
b),  colour  can  be  established  directly  from  the  weighted  average  of  the  colour  (or  surface 
normal)  values  at  the  vertices,  without  reference  to  scan  line  directions.  For  Gouraud 
shading, 


I(P)  =  (1  -  a  -  b)I(Vo)  +  al(Vi)  +  bI(V2). 

For  Phong  shading,  the  mean  surface  normal  vector  is 

n(P)  =  Normalise[(l  -  a  -  b)n(V0)  +  an  (VO  +  bn(V2)], 

where  n(P)  is  the  surface  normal  allocated  to  point  P,  and  Normalise  is  a  function  to 
reduce  a  vector  to  unit  length.  The  argument  of  a  ‘Normalise’  function  cannot  have  zero 
length,  this  would  only  occur  in  the  case  of  a  degenerate  or  non-manifold  object. 

This  gives  an  easy  way  to  fill  triangles  using  IFS  with  barycentric  coordinates, 
which  can  be  shown  to  give  exactly  the  same  results  as  standard  Gouraud  and  Phong 
shading.  The  barycentric  method  is  independent  of  the  scan  line  direction,  and  this 
equivalence  indicates  similar  independence  of  the  traditional  methods  from  the  scan  line 
direction  for  triangle  shading,  which  is  not  the  case  for  polygons  with  more  edges. 


Figure  5.  Extended  barycentric  coordinates  identify  an  inerior  point  P  in  a  quadrilateral. 


4.2  Quadrilateral  rendering 

We  can  identify  an  internal  point  P  of  a  quadrilateral  V0ViV2V3  using  ‘extended 
barycentric  coordinates’  (a,  b).  There  is  a  unique  line  WoWi  which  passes  through  P  so 
that  the  edges  V0Vi  and  V3V2  are  divided  in  equal  proportion,  a:(l  -  a)  (Fig.  5).  In  other 
words, 


W0  =  (l  -  a) Vo  +  aVi, 
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Wi  =  (1  -  a)V3  +  aV2, 


where  0  <  a  <  1 .  If  P  lies  a  proportion  b  along  W0W i 

P  =  (1  -  b)W0  +  bWi, 

where  0  <b  <  1.  This  effectively  maps  any  interior  point  of  the  quadrilateral  into  a  unit 
square  (Fig.  6),  where 

V'o  =  (0,  0),  Vi  =  (1,  0),  V2  =  (1,  1)  and  V'3  =  (0,  1). 


Figure  6.  Mapping  of  the  quadrilateral  of  figure  5  into  a  unit  square 

This  is  equivalent  to  the  effect  of  Fig.  3  for  a  triangle.  The  meaning  of  an  internal  point  is 
obvious  for  a  planar  convex  quadrilateral  (Fig.  5).  For  non-planar  vertices,  the  point  P 
lies  on  a  bi-linear  patch,  generating  a  form  of  hyperbolic  paraboloid  .  For  planar  concave 
quadrilaterals,  P  may  lie  outside  the  triangle  boundary,  but  within  the  convex  hull  of  the 
vertices.  Such  quadrilaterals  can  be  identified  and  split  into  triangles  when  rendering, 
although  this  allows  the  ambiguity  of  two  possible  interpretations. 

Substituting  for  W0  and  Wi  into  the  expression  for  p  gives 

P  =  (1  -  b){(l  -  a) Vo  +  aVi}+  b{(l  -  a)V3  +  aV2}, 

P  =  (1  -  a)(l  -  b)V0  +  a(l  -  b)Vi  +  abV2  +  (1  -  a)bV3, 

giving  P  as  a  weighted  mean  of  the  four  vertices.  This  is  valid  as  each  of  the  four 
weights(l  -  a)(l  -  b),  a(l  -  b),  (1  -  a)b  and  ab  lies  between  zero  and  1  from  conditions 
on  a  and  b,  and  it  is  easy  to  establish  that  their  sum  is  one.  This  gives  a  direct  method 
for  proportional  shading  from  vertex  colours  or  normals,  which  is  the  same  as  standard 
scan  line  methods  only  when  W0Wi  happens  to  be  aligned  with  a  scan  line  (Fig.  7).  The 
IFS  method  is  an  improvement  on  standard  methods,  in  that  the  shading  allocated  to  a 
point  is  not  dependent  on  quadrilateral  orientation. 

The  chaos  game  applied  to  the  four  vertices  of  a  square  (Fig.  6)  will  fill  the  square  . 
By  selecting  V'i  to  be  one  of  the  vertices  V0,  V'i,  V'2  or  V'3  at  random,  and  finding  the 
mid  point  ofP'  and  V'i,  we  have  the  essential  step  of  the  chaos  game  that  will  fill  the 
square  V0ViV2V3.  It  should  be  clear  that  the  required  mid  points  have  coordinates 


174 


•  P  and  V0:  (0.5a,  0.5b); 

•  F  and  Vi:  (0.5(1+  a),  0.5b); 

•  P'  and  V2:  (0.5(1  +  a),  0.5(1  +  b)); 

•  P'and  V'3:  (0.5a,  0.5(1  +b)). 

Applying  the  usual  chaos  game  directly  to  the  vertices  of  a  general  quadrilateral  V0W V2V3 
(Fig.  5)  will  only  fill  it  if  it  is  a  parallelogram.  When  the  quadrilateral  is  skew,  a  fractal 
‘Sierpinski  tetrahedron’  will  result9.  If  the  chaos  game  is  applied  to  the  extended 
barycentric  coordinates  of  a  general  quadrilateral  according  to  the  algorithm  specified 
below,  it  fills  all  ‘interior’  points  as  explained  earlier  in  this  section,  not  now  with 
uniform  density,  but  still  giving  weights  that  lead  to  easy  rendering  from  vertex  colours 
or  normals. 

Set  (a,  b)  to  (0,  0) 

Loop  for  i  =  1  to  a  large  value 
Select  j  at  random  from  {0,  1,  2,  3} 

Case 

j  =  0:  set  a  =  0.5a,  b  =  0.5b; 
j  =  1 :  set  a  =  0.5(1  +  a),  b  =  0.5b; 
j  =  2:  set  a  =  0.5(1  +  a),  b  =  0.5(1  +  b); 
j  =  3:  set  a=  0.5a,  b  =  0.5(1  +  b); 

End  case 

Set  Pj  to  have  extended  barycentric  coordinates  (a,  b) 

Plot  point  Pi 
End  loop 

For  Gouraud  type  shading,  we  have 

I(P)  =  (1  -  a)(l  -  b)I(Vo)  +  a(l  -  b)I(Vi)  +  ab  I(V2)  +  (1  -  a)bI(V3), 

where  I(Vi)  is  the  colour  allocated  to  vertex  Vi,  as  above.  For  Phong  type  shading, 
normals  are  interpolated  by 

n(P)  =  Normalise[(l  -  a)(l  -  b)n(V0)  +  a(l  -  b)n(Vi)  + 
abn(V2)  +  (1  -  a)bn(V3)]. 

Shading  depends  only  on  vertex  values,  so  is  independent  of  scan  line  direction,  unlike 
standard  methods.  Symmetry  of  expressions  shows  that  the  order  of  selection  of  vertices 
does  not  affect  shading. 


5  An  example:  Gouraud  shading  of  a  cone 

We  consider  Gouraud  shading  of  a  cone  as  a  simple  example.  If  the  cone  is  represented 
polyhedrally  as  a  pyramid  with  n  sloping  triangular  faces  and  a  regular  n-gon  as  its  base 
(Fig.  7),  there  is  a  problem  of  how  to  allocate  surface  normals  to  the  triangle  vertices 
which  meet  at  V.  If  we  average  normals  for  all  triangles  meeting  at  this  point,  the  applied 
normal  points  directly  out  of  the  vertex  along  the  cone’s  axis,  giving  the  vertex  an 
incorrect  flattened  shading.  Using  the  triangle’s  own  normal  at  V  for  each  triangle  avoids 
this,  but  this  shows  creasing  along  triangle  edges  near  V,  as  there  is  no  continuity  across 
triangle  edges.  The  singular  point  at  V  has  different  normals,  hence  shading  values,  close 
to  V. 
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Figure  7.  Polygonal  approximation  of  a  cone  as  a  pyramid  on  a  regular  polyhedral  base. 


Figure  8.  Representation  of  triangle  BCV  as  a  degenerate  quadrilateral. 

If  vertex  triangles  are  replaced  by  quadrilaterals  with  a  zero  length  edge  at  V  (Fig.  8),  the 
problem  can  be  overcome.  At  Vi,  the  surface  normal  is  taken  as  the  normalised  mean  of 
normals  of  faces  VAB  and  VBC  and  at  V2  the  normal  is  the  normalised  mean  of  the 
normals  of  faces  VBC  and  VCD.  If  Vi  and  V2  are  made  coincident  at  V,  this  gives 
continuity  at  V,  avoiding  creasing.  The  now  degenerate  trapezium  ViBCV2  has  the  same 
shape  as  the  isosceles  triangle  VBC.  There  is  still  a  singular  point  at  V,  as  more  than  one 
normal  is  allocated  to  the  same  point,  with  a  multiplicity  of  colour  values  possible  at  the 
vertex  on  rendering,  but  this  is  anti-aliased  by  the  random  nature  of  the  IFS.  Figure  9 
clearly  shows  creasing  from  the  standard  method  and  its  elimination  by  the  degenerate 
quadrilateral  method,  both  rendered  using  an  IFS  on  an  eight-sided  based  pyramid.  (This 
image  was  rendered  on  a  16  bit  machine,  so  some  colour  banding  is  seen). 
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Figure  9.  Gouraud  shaded  cones  represented  as  octagonal  based  pyramids.  The  left  image  shows  creasing  at 
the  vertex,  the  right  image,  which  uses  degenerate  quadrilaterals,  has  a  smoother  representation. 


6  Summary 

A  method  for  using  IFS  to  render  non-fractal  forms  in  a  way  similar  to  standard  scan  line 
algorithms  has  been  shown.  Compared  to  scan  line  methods,  IFS  rendering  uses  different 
algorithms  for  triangles  and  quadrilaterals,  not  dealing  with  higher  orders.  The  algorithm 
selects  world  points  at  random,  so  stochastic  anti-aliasing  occurs  by  painting  each  pixel 
with  a  sample  of  its  possible  colours.  This  is  an  advantage  for  still  images, 
but  may  give  undesirable  ‘boiling5  on  animation.  Standard  methods  visit  each  pixel  once 
per  polygon,  the  IFS  method  has  redundancy,  with  pixels  being  visited  many  times 
without  a  guarantee  that  all  are  filled.  IFS  methods  are  unlikely  to  challenge  established 
methods  on  speed.  However,  programming  them  is  relatively  simple,  with  clipping, 
hidden  surface  and  shadow  problems  reduced  to  simple  decisions. 

An  example  using  Gouraud  type  shading  on  a  basic  shape  is  shown  (Fig.  9).  There 
is  much  scope  for  extension.  Work  on  Phong  type  shading  with  specular  highlights  and 
on  texture  mapping,  for  which  the  method  should  be  particularly  suited,  is  underway  at 


Figure  10.  IFS  images  showing  fractal  and  non-fractal  objects  using  z-buffer  and  shadow  buffer  and  showing 
the  potential  of  the  method  for  texture  mapping. 
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the  time  of  writing.  As  an  indication  of  the  scope  of  the  method,  two  images 
incorporating  fractal  trees  (with  non-affine  transformations),  shadows  and  texturing  (for  the 
trellis  and  mown  grass  effect)10  are  shown  in  Fig.  10. 
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THE  o-HULL  -  THE  HULL  WHERE  FRACTALS  LIVE 
CALCULATING  A  HULL  BOUNDED  BY  LOG  SPIRALS  TO  SOLVE 
THE  INVERSE  IFS-PROBLEM  BY  THE  DETECTED  ORBITS 


179 


ERWIN  HOCEVAR 

Technical  University  of  Vienna,  Institute  of  Automation  183/2 
A- 1040  Treitelstr.  3,  Vienna,  Austria 
E-mail:  hocevar@zdvaxa.  arcs.  ac.  at 


Global  IFS  seem  to  be  suited  best  for  compressed  encoding  of  natural  objects  which  are  in  most  cases 
self  affine  even  if  not  always  exactly.  Since  affine  transformations  -  the  IFS-Codes  -  resp.  the  union  of 
all  their  orbits  generate  an  object  (an  IFS-Attr actor),  the  detection  of  a  non  minimal  set  of  these  orbits 
solves  the  inverse  IFS-Problem  by  calculating  a  superset  of  IFS-Codes,  which  has  to  be  minimized. 

Here  a  method  is  presented  how  tliese  orbits  (in  particular  those  on  the  object  boundary)  can  be 
calculated.  Therefore  a  generalized  convex  hull  -  the  c-Hull  -  is  defined.  This  fractal  hull  is  bounded 
by  log  spirals,  that  curves  formed  by  the  orbits.  It  is  shown  that  log  spirals  can  be  represented  by  a 
continuos  function  of  powers  of  affine  maps  and  that  by  using  this  "spiral  equivalent"  the  generating 
transformations  of  the  orbits  by  which  an  IFS-object  is  bound,  can  be  calculated  in  the  x/y-plane. 
Further  more  this  representation  can  be  used  for  the  classification  of  the  detected  orbits,  necessary  to 
calculate  the  IFS-Codes  of  a  minimal  IFS  from  their  generating  transformations,  subsequently. 


1  Introduction 

LI  Problem  Specification  -  Objects  to  be  encoded 

The  idea  is  to  globally  encode1  objects  of  an  image  appearing  in  the  nature  and  therefore 
often  self  similar  to  a  high  degree  but  difficult  to  encode  or  to  compress  by  a 
mathematical  formula.  This  global  IFS-Approach 2  -  representing  an  object  by  the  union 
of  affine  contractive  transformed  copies  (subobjects  generated  by  the  IFS-Codes)  of  the 
object  itself  -  seems  to  be  used  best  to  solve  the  inverse  (global)  IFS-Problem. 

Thus  the  Collage  Theorem  which  can  be  used  to  generate  whole  objects  of  an  image 
is  given:  A  set  {<»,:  RJ  ->  R2 1  Wj  affine,  contraction}  of  complete  Euclidean  space  (R\d) 
is  an  Iterated  Function  Systems  (IFS)  with  contractivity  factor  s,  for  i  e  N,  the  distance 
d,  0  <  s  <  1  and  s-h(B,  C)  >  h(W(B),  W(C)),  if  for  the  Hutchinson  operator  Wa  holds 

W(A)  =  U  u  (X)  and  A  =  W(A). 

fl)j6W  x£A 

The  compact  set  A  is  called  an  attractor  of  W. 

The  Collage  Theorem  (CT)  for  attractors  close  to  given  compact  sets: 

h(L,  W(L))  <  e  =>  h(L,  a)  <  8/(  1-s) 
where  h  is  called  the  Hausdorff  Distance 


a  W:  1^(R2)  ->  ^R2)  with  W(A)  =  (W(a)  |  aeA},  Ae  ^(R2)  is  a  contractive  map  of  (S?(R2),h)  the 
space  of  nonempty  compact  subsets  of  R2  where  h(C,B)  -  max{d(C,B),  d(C,A)}  and  C,  Be^R2). 
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L  c  R2  is  a  given  compact  set  of  (R2,d)  and  e  >  0.  For  s  =  0  (L  =  A)  the  CT  for  exactly 
self  affine  attractors  is  obtained.  In  principle  the  CT  can  be  applied  to  any  object  if 
sufficient  many  IFS-Codes  Mj  are  used.  However,  to  compress  an  object  the  number  of 
parts  <»;  (L)  (the  subobjects)  have  to  be  small  for  a  small  e. 

For  the  discrete  pixel  space  the  discrete  CP  is  defined  as  a  set  {®  :  P2  ->P2 1  <» 
affine,  contraction}  of  the  pixel  space  (P2,d)  -  P  =  {0,...,R-1 }  c  N  -  is  a  Discrete  Iterated 
Function  Systems  (DIFS)  with  contractivity  factor  s,  for  m  €  P,  R2  the  number  of  image 
screen  pixels,  the  block  distance  d,  0  <  s  <  1  and  s*h(B,C)  >  h(W(B),  W(C»,  if  for  the 
Hutchinson  operator  W:  &(P2)  ->  -#(P2)  holds 

W(A)  =  U  Ufflta(x)  and  A  =  W(A). 

meP  xeA 

The  set  A  is  called  an  discrete  attractor  of  W.  The  discrete  attractor  Ae^P7)  is  not 
unique,  but  there  always  exist  a  unique  maximal  attractor  A^e^P2)  with  A= W(A)  => 
Ac  Amax,  which  contains  all  discrete  attractors. 

The  Chaos  Game  generates  an  attractor  A  by  the  iterated  random  application  of  the 
different  <»„,  to  one  image  point  i.e.,  by  the  union  of  oibits  formed  by  all  combinations  of 
eom  -  the  basic  principle  of  the  presented  solution. 

In  contrast  to  global  IFS,  an  encoding  based  on  Partitioned  IFS  (PIFS)4  divides  the 
image  into  larger  and  smaller  segments  (polygons).  The  smaller  target  segments  are 
interpreted  as  affine  contractive  copies  of  the  larger  ones  and  then  the  CT  is  applied. 
Since  only  IFS-Properties  of  affine  relationships  constrained  to  the  segment  form  can  be 
exploited,  all  of  the  many  image  parts  are  only  approximated  by  an  IFS-encoding. 

1.2  The  Solution  -  How  to  encode  global  IFS-Objects 
Calculating  boundaries  of  an  object 

In  general  the  boundary  of  an  object  contains  the  most  information  how  an  object  is  built. 
Points  inside  an  object  normally  cannot  be  related,  because  they  are  difficult  to 
distinguish.  The  common  method  to  determine  the  boundary  of  an  object  -  especially  if  it 
is  defined  in  a  discrete  (pixel)  space  -  is  to  form  the  hull  of  an  object. 

We  first  can  try  it  by  the  convex  hull  »A),  which  is  defined  as  "Intersection  of  half 
planes  containing  a  point  set  (the  attractor)  A".  It  soon  appears  that  a  hull  of  such  a  kind 
is  not  usable  for  IFS-Attractors,  because  only  the  boundary  of  convex  objects  -  what  an 
IFS  certainly  is  not  -  can  be  determined. 

Second  we  can  try  a  more  general  hull  -  the  ex-Hull5  »a  -  defined  as  follows : 

The  a-Hull  ^a(A)  (see  Fig.  I)  is  defined  as  the  intersection  of  all  generalized  disks  B(c,  r) 
resp.  B(c,  -r)  of  non  negative  radius  r  =  Va  and  the  centre  c  containing  the  attractor  A, 
where  B(c,  -r)  is  the  complement  of  the  disk  B(c,r).6 

The  boundary  of  an  a-Hull  is  formed  by  a-Extremes  (see  Fig.  1): 

A  point  s  e  A  is  termed  a-Extreme  (comer  of  »a(A))  in  A,  if  there  exists  a  B(c,  i/a) 
such  that  s  lies  on  its  boundary  and  contains  all  other  points  of  A.6 

Now  it  is  possible  to  analyse  concave  objects  and  even  objects  concave  with  different 
curvature  using  different  radii  of  the  generalized  disks.  But  we  need  a  relationship  of 
boundary  points  (extremes)  which  provides  more  information  about  the  object.  Using  the 
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a-Hull  we  get  only  a  relationship  of  points  laying  on  the  same  edge  of  a  disk  of  different 
radius  forming  the  a-Hull.  But  the  boundary  points  of  an  IFS- Attractor  do  not  lay  on 
circles.  They  form  orbits^  in  the  discrete  space  which  are  given  as  subset  of  W+  c  for  an 
IFS  W  (as  it  is  easily  to  observe  in  the  application  of  the  Chaos  Game).  These  orbits  lie 
on  logarithmic  spirals1,  what  will  be  shown  in  this  paper.  So  the  best  suited  hull  to 
determine  an  IFS-Attractor  seems  to  be  the  a-Hull  which  is  bounded  by  log  spirals  and 
will  be  detailly  defined  and  described  in  the  main  part  of  this  paper. 

Calculating  a  minimal  IFS 

Once  found  the  orbits  on  the  boundaries  of  different  a-Hulls  they  have  to  be  classified  to 
calculate  the  IFS-Codes: 

Classes  C{  =  {orbits  which  can  be  affinely  mapped  onto  eachother}  are  formed. 

The  potential  IFS-Codes  are  the  generating  transformations  of  the  largest 
orbits^  of  each  class. 

Then  the  final  IFS-Codes  coj  are  the  generating  transformations  of  the  largest  orbits 
within  each  C{  which  can  be  mapped  onto  eachother  by  (have  an  inter  orbit 
transformation  in)  {©j}* 

Thus  the  final  and  also  minimal  IFS  can  be  calculated  by  the  following  iterative  process 

(a)  which  restrains  the  classes  to  those  orbits  which  can  be  mapped  onto  each  other 
only  by  detected  so  far 

=>  more  and  more  classes  are  generated  (and  the  set  {tUj}  is  increased) 

(b)  which  enlarges  the  classes  to  those  orbits  which  can  now  be  mapped  onto  each 
other  only  by  detected  so  far 

=>  fewer  and  fewer  classes  are  generated  (and  the  set  is  decreased) 

This  process  is  repeated  until  no  more  (in  (a))  resp.  no  fewer  (in  (b))  classes  are 
generated.  Then  the  generating  transformations  of  the  largest  orbits  of  each  remaining 
class  form  a  minimal  set  of  the  IFS-Codes  of  the  object. 

Since  a  a-Hull  is  only  useful  to  find  the  orbits  on  the  boundary  of  an  object,  orbits  of 
inner  subobjects  (having  no  boundary  in  common  with  the  object)  cannot  be  found  in  the 
first  step.  But  if  you  form  the  difference  between  the  object  Aq  and  the  point  set  gener¬ 
ated  by  the  IFS-Codes  in  the  first  step  one  will  get  a  difference  object  A}  in  the  second 
step.  Those  affine  transformations  mapping  boundary  orbits  (detected  by  different  a- 
Hulls)  of  the  object  A0  to  those  of  the  difference  object  Ax  are  the  IFS-Codes  for  the  inner 
subobjects  now  lying  outside.  The  second  step  has  to  be  repeated  for  the  object  Ai  (i  e  M) 
as  long  as  the  difference  object  Ai+1  is  not  empty.  Adjusting  the  obtained  results  and 
eliminating  possible  deviations  by  comparing  the  inverse  IFS-mapping  of  the  hull  of 
inner  subobjects  with  the  object  hull,  also  the  inverse  IFS-Problem  for  overlapping 
subobjects  can  be  solved. 


b  An  orbit  O  is  a  point  sequence  on  =  <J>(n)(x)  VneN  where  $(n)  =  n  f  <f>  affine  and  x  e  R2. 

n 

The  point  sequence  (orbit)  <f>(n)(p)  converges  for  a  contractive  affine  maps  <|>  to  the  fixed  point  f  of  $ 
c  { IT  %  I  ®n  6  W  u  an^  W+=  W*  \  {e}  with  W  ={mi  |  w  affine}  and  s  the  Identity  Map. 

neN 

d  The  largest  orbits  of  a  subobject  p(  A)  (peWu  {s})  are  those  orbits  O  =  {o0,  ...,on}  with  a 
minimal  contraction,  where  (o0)  e  A  for  n  >  0  and  (o0)  g  A  for  n  <  0  and  $  e  W+. 
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2  Definition  and  Application  of  the  o-Hull 

2 . 1  The  o-Hull  (fractal  hull)  in  Rx  (p 


Figure  1.  The  convex  Hull,  the  a-  and  cr-Hull  of  an  Object 


We  first  introduce  the  notion  of  generalized  spiral  disks.  Let  D*(f,  r)  denote  a  spiral  disk 

00 

defined  by  the  point  set  (Jp*  with  fixed  point  (centre)  f,  the  non  negative  radius  r  =  Rek(p 

i 

where  e  R  x  9  and  d(f,  p{)  <  r  and  0  <  <p  <  2mrc5  m  e  N. 

A  generalized  spiral  disk 


iy«r)s 


|D*(f~r  )c 
I  D*(f,r) 


r  <0 
r^O 


(1) 


where  Xc  is  the  complement  of  X. 

Since  D’(f,  r)  is  not  solely  bounded  by  a  log  spiral  (but  also  by  a  part  of  the  q>  =  0  axis) 
a  generalized  spiral  disc  segment  has  to  be  constructed  bounded  only  by  equally  curved 
log  spirals  having  the  same  fixed  point. 

Thus  a  generalized  spiral  disk  segment  is  given  by  (Fig.  1): 

D(f,  r)  =  D'(f,  Rek<p)  n  D'(f>  n  D'(f,  (2) 


Note:  For  negative  k  one  will  get  a  spiral  disc  segment  reflected  by  the  q)  =  0  axis  ! 

Ca(S)  is  the  set  of fixed  points  of  spiral  disc  segments  of  radius  l/a  that  have  S  as  subset 


Cct(S)  =  {x€Rx(p|Sc  D(x, 1  la)} 


(3) 


where  X  is  the  set  closure  of  X. 

Let  X  be  an  arbitrary  (finite  or  connected  set)  in  R  x  cp  then  the  intersection  of  all 
generalized  spiral  disc  segments  with  varying  fixed  point  x  e  X  and  a  fixed  radius  l/G 
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[  R x(p  X  =  0 

is  denoted  by  MC(X)  =  j  p|D(x,  '/a)  X*  0  (4) 

[xeX 

where  0  is  the  empty  set. 

The  o-Hull  of  S  is  the  intersection  of  all  generalized  spiral  disk  segments  of  radius 
that  contain  all  points  of  S.  Thus  the  o-Hull  (fractal  hull)  is  given  by 

^a(S)  =  Mo(Cct(S))  (5) 

A  point  s  g  S  is  termed  o-Extreme  (corner  of  ^o(S))  in  S,  if  there  exists  a  D(c,  t/a) 
such  that  s  lies  on  its  boundary  and  contains  all  other  points  of  S  (see  Fig.  1). 

Thus  the  set  of  o-Extremes  is  defined  by 

Ea(S)  S  {s  G  S  I  Be  G  Cct(S):  s  g  0(c,i/o)}  (6) 

where  cX  is  the  boundary  of  S. 

As  in  the  limit  case  of  infinite  radius  (i.e.  l4  close  to  0)  resp.  of  radius  =  R  (k  =  0)  the 
generalized  spiral  disc  segment  becomes  a  half  plane  resp.  a  circle  of  radius  R,  this  defi¬ 
nition  includes  the  definition  of  the  convex  Hull  #(S)  resp.  of  the  a-Hull  #a(S)  as  well. 


2.2  Defining  the  o-Hull  in  R2 

To  represent  a  log  spiral  r  =  Rek(p  defined  in  the  polar  plane  (R  x  <p)  an  equivalent  for  the 
x/y-plane  (R  x  R)  has  to  be  found.  The  best  suited  equivalent  seems  to  be  one  formulated 
in  terms  of  affine  transformations.  In  addition,  this  representation  will  relate  log  spirals 
and  orbits  (lying  on  the  log  spirals)  which  union  forms  an  IFS  and  where  the  generating 
transformations  of  the  largest  of  these  orbits  are  the  IFS-Codes. 

Theorem  1  -  Log  spiral  equivalent  -  A  log  spiral  r  =  Rekcp  can  be  represented  by  a 
continuos  function  of  powers  of  an  affine  transformation  a>. 

Re^  =  <on  =  Ro(a)  S(cx",  cy»)  R^-a)  Ro(n(-P)), 
where  n  e  R,  a  e  [-jc,  tc),  (3  g  [-7c,  tc)  and  p  *  0 

Lemma  1  Each  affine  transformation  can  be  represented  by  a  symmetric  matrix  and  a 
rotation  matrix. 

( a  b^1  ( r  s^\ 

Vco  =  g  R2x2  Bp  G  [-7C,  7c)  3\|/  —  €  R2x2  : 

lc  d)  l,s  t) 


(  r  cos B  +  s  sin/3  s  cosfi  -  r  smfi)  h  _  c 

G)  =  m/Ro(-P)  -  a  .  a  a  .  a  with  tanp  =  -^4 
^  l  scos/3+t  sin/3  t  cosj8-s  sin^  I  a+d 


a2  +b2  +ad  -  be 
/(a  +  d)2  +  (c-  by 


and  s 


f(a  +  d)2+(c-by 


'(a  +  d)2+(c-b)2 


Proof  of  Lemma  1 :  in 


Lemma  2  Each  symmetric  matrix  can  be  represented  by  a  rotation  a,  a  scaling  matrix 
and  a  rotation  -a. 
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rx 

VM'  =  ys  t  J  e  R2x2  3  a  e  [-jt,  tc)  3  cx>  cy  eR:.p  R^a)  S(cx,  cy)  R^-a) 


with  the  eigenvalues  c  = 


r+t  +  y(r-t)2  +4s2 


r+t--J(r-t)2  +  4s2 


and  a  =  arctan—^  the  angle  of  the  eigenvector  c} 


Proof  of  Lemma  2  \  in 


Lemma  3  Each  affine  transformation  to  maps  a  point  x  €  Rek(p  again  to  a  point  x1  g  Rek(p. 
Vco  g  R2x2  and  x  g  Re1^  :  (ox  =  xx  and  x}  e  Rek(p 

Proof  of  Lemma  3: 

„  ,  [~r.  —  karctan- 

r  -  Rek(p  resp.  y]x2  +  y  -  Re  x 
x  =  r  coscp  =  Rektpcos(p  and  y  =  r  sincp  =  Rek(psin(p 


ox  =  col  |  =  0) 

[y  J 


Rek^ cos (p)  ,  ( acos<p+bsin (p\ 

=Rek<p ^.1  =*> 


Re  9  sin  <p 


ccos<p+dsin<p 


q>  =  arctan—  =  arctan  acos*+bsin  9  and 

X  ccos(p+dsin(p 

r  =  ^/x2  +  y 2  =  Rek(P>/  (a  cos  <p + b  sin  <p)2  +  (c  cos  (p + d  sin  <p)2 

.  I - - - T  ,  . 

Rek(p-y(acos(p4-bsin9)2 +(ccos^+dsin9)2  -  Rek{pe  acos^bsm?.  :ektp 


R2(acos<p+bsin<p)2  +  R2(ccos<p+dsinp)2  =  Re 


k  aictan  c  cos  ^  sin  <p 


x2  y2  arctan—  =  arctan— 

Rx  x 

Lemma  4  A  power  n  g  G  of  an  affine  transformation  maps  x  g  Rek(P  again  to  Xj  e  Rek(p. 

V©n  g  R2x2  and  x  g  Rek(p :  conx  =  Xj  and  x}  g  Rek(p  where  n  g  G 
Proof  of  Lemma  4. 

Lemma  3  holds  for  each  point  on  the  log  spiral  Rek(p 

I.e.  also  for  ox  and  for  ©2x  and  so  on  ...  until  ©nx  resp.  also  for  ©-1, ©~2, ©_n 
Corollary  1  Each  power  of  an  affine  transformation  can  be  represented  as  follows. 
f a  bY1 

o)n=  *R0(a)S(c/,Cy^(H*)R0(n(.p)) 


c"cos2  a+Cy sin2  a  (c"  -cpcosasina 
(c“  -  c" )  cos  a  sin  a  c"  cos2  a + c!  sin2  a 


Ro  (n(-P)) 
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Proof  of  Corollary  1 : 

Because  of  Lemma  4,  Lemma  1  can  be  also  defined  for  n  g  G  : 

(t  s^ 

con  =  vj/HR^-p)11  where  \\f  = 

Vs  V 

=><»n  =  (^Ro(-P))"  =  v|/nRo(-p)n  =  ynRo(n(-p))  = 

Ro(a)  S(cx,  cy)  RflC-aiRflCa)  S(cx,  cy)  R0(-a)R0(a) ...  S(cx,  cy)  Ro(-<x)Ro(n(-P))  = 
Rfl(a)  S(cx,  cy)  Rq(0)  S(cx,  cy)  Ro(0) ...  S(Cx,  cy)  RoC-cORoW-P))  = 

Ro(a)  S(cx,  cy)n  Rof-ajR^nCa+p))  =  R^a)  S(cxn,  cyn)  Ro(-a)  R^fnf-p)) 

Corollary  2  Lemma  4  can  also  be  defined  for  powers  neR  instead  of  n  €  G. 

V©n  €  R2x2  and  x  e  Rekcp :  conx  =  xt  and  xx  e  Rekq>  where  n  g  R 
Proof  of  Corollary  2 : 

Lemma  4  also  holds  for  n  e  R,  if  <on  is  represented  according  to  Corollary  1,  since  powers 
n  g  R  not  applicable  to  ©  are  now  defined  for  cx  and  cy  (i.e.  n  is  no  more  restricted  to  G) 

Summary  (Proof  of  Theorem  1) 

Lemma  4  -  using  Lemma  3  -  shows  that  co11  (n  e  G)  generates  point  sequences  on  log 
spirals  (the  so-called  orbits).  Corollary  2  points  out  that  for  n  g  R  the  orbits  of  ©n  form 
log  spirals.  Corollary  1  defines  -  by  means  of  Lemma  1  and  Lemma  2  -  a  representation 
of  on  =  R^a)  S(cxn,  cyn)  Ro(-a)  R^nf-p))  which  can  be  used  to  calculate  on  in  R  x  R  for 
n  e  R.  what  concludes  the  chain  of  the  proof  for  Theorem  1. 


Definition  of  the  o-Hull  in  R2 

oO 

Let  D*(f,  con)  be  a  spiral  disk  pl  with  fixed  point  f,  bounded  by  <on  with  0  <  n  <  2m7t/p) 

i 

m  g  N  and  radius  |oon(s— f)|,  where  p|  e  R  x  R  and  d(f,  p^  <  |©n(s-f)|,  sgRxR. 


Then  D'(f,  ©n)  = 


jD*(f,fiTn)c 
[  D*(f,ffl") 


n<0 

n^O 


(7) 


denotes  a  generalized  spiral  disk  in  R2. 

The  generalized  spiral  disc  segments  described  by  means  of  log  spirals  in  R  x  cp  as  in  (2) 
can  now  be  defined  using  a  continuos  function  of  powers  of  affine  transformations  in  R2: 

D(f,  ®")  =  D,(f,<a")oD,(f,R0('r/4)ffl-n)nD,(f,R0(72)s(-l,l)(»n)  (8) 


where  con  =  RQ(cx)S(cxn,  cyn)R0(-a)R0(n(-P)  and 0<cx<  1, 0<cy<  1,  -7c<a<jt,-jc<p 
Note:  for  negative  a  and  p  one  will  get  a  spiral  disk  segment  reflected  by  the  x-axis 

Which  -  by  adapting  the  definition  (3)  of  the  set  of  fixed  points  CCT(S)  to  R  x  R  -  leads  to 
the  definition  of  the  intersection  of  generalized  spiral  disk  segments  Ma(X)  resp.  the  a- 
Hull  in  R  x  R  instead  of  R  x  9  in  (4)  resp.  (5) 

[  RxR  x  =  0 

Ma(x) -  j  n°(x.  7|o/fs-f)|)  X*0  resp.  »0(S)sM<j(C<j(S))  (9) 

[xeX 
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This  leads  to  additional  special  cases  for  the  a-Hull  defined  in  R2  : 
cx  =  cy  in  cyn)R0(-a)R0(n(-P) 

=»  D(f,  ©n)  is  bounded  by  2  straight  lines  forming  an  angle  of 
If  cx  *  cy  and  a  =  0  in  R^a)  S(cxn,  cyn)  R^-a)  Ro(n(-P)) 

=>  D(f,  ©n)  is  bounded  by  3  exponential  parabolas. 

Both  shapes  are  necessary  to  find  all  orbits  (also  those  laying  on  non  spiral  like  curves) 
by  which  an  IFS  is  bound. 


2.3  Calculating  Orbits  using  the  a-Hull  in  R2 
Calculation  of  Orbit  Generating  Transformations 

Lemma  5  The  curvature  proportion  p  of  ©n  s  RekcP  is  defined  by  a  k  independent  of  p 

V®n  =  Re*49  with  p  =  R/r  :  k  =  «+4sin2 «) 

(cx~cv)sm  acosa 
2 arctan -  x  y' 


cxcos2  a+cysin2  a 


Proof  of  Lemma  5  (cf.  Proof  of  Lemma  3) 


Using  \j/x  = 


r  s 


Vs 


cxcos2  a+c" sin2  a  (cx -Cy)cosasina 


(cx ~c")cosasina  c"cos  a+cxsin  a; 
for  the  substitution  of  x  and  y  resp.  r  and  cp  in  the  definition  of  log  spirals: 

f - - -  karctanCC°S^fSin^ 

R^Kacosp+bsintp)2  +  (c  cos  (p+d  sirup)  =  Re 

(cx+cycos2a)2  +  ((cx-c  )sina  cosot)2  =  e  c*cos2  «+cysm  «  for  ^  =  q 


]  of  Corollary  1 


leads  to 


2k  arctan 


(cx  -cy)sinofcos 


cxcos2  a+Cvsin2  a 


=  ln(c2cos2a+c2sin2a) 


Since  for  fixed  a  (e.g.  tana  1  1)  different  values  of  cx  and  cy  form  log  spirals  Re1^  with 
arbitrary  k  (different  in  p)  k  resp.  1/k  can  be  represented  in  dependency  of  cx  and  cy: 


resp. 


2  arctan 


cv  +  c. 


4^) 


(for  1/k  cx,cy  turn  into  and  ^  !) 


2  arctan 


Now  (A)  the  nominator  resp.  (B)  the  denominator  of  the  above  equations  for  k  and  1/k 

can  be  equated  to  — —  ■  resp.  — — to  calculate  cv  and  cv. 

2(k+l)  F  2(k  +  l)  x  y 


-ktt  -k"*^ 

(A)  =>  c2x+c2y  =  2e2(k+1)  for  k  and  cx+cy  =  2e2(k  +1)  for  k"1 

(B)  =>  cx-c  =  -(c+cy)tan— -----  for  k  and  - JcT  =  + Jc7)tan - j —  for  k'1 

This  leads  to: 
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Note:  Now  it  obvious  why  k  and  1/k  is  treated  differently,  because  otherwise  at  least  one 
of  the  resulting  cx  or  cy  can  become  greater  than  one  in  some  cases. 

The  representation  of  an  orbit  co11  by  a  continuos  function  of  n  (Theorem  1)  allows  to 
form  the  a-Hull  for  an  attractor  A  paramertized  only  by  cx  and  cy  and  a  fixed  a  (because 
in  that  case  an  arbitrary  p  leads  to  equal  curved  a-Hulls  according  to  Lemma  5).  That  is, 
to  find  an  oibit  generating  transformation  co,  where  a  maximal  number  of  discrete  bound¬ 
ary  points  of  A  lie  on  the  orbit  <on  (for  n  €  G)  forming  the  edge  of  a  a-Hull  (for  n  €  R). 

Now  generating  transformations  cos  of  orbits  can  be  calculated  using  the  equivalent 
representation  of  a  a-Hull  in  R2  if  a  a-Hull  is  found,  where  at  least  four  boundary  points 
of  A  p1?  p2,  p3,  p4  lay  on  one  of  the  circumscribed  spiral  segments  S  with  the  fixed  point 
f,  forming  the  edge  of  this  (best  fitting)  hull 

Thus,  i  €  {1,2, 3, 4}:  e  ©n  4  ft  e  RqO k/4 Kn  *  pj  e  R0(ic/2)S(-l,l)©n  and  «>n 

forms  the  edge  of  the  a-Hull 

The  cx,  c  ,  a  and  p  -  for  the  log  spiral  equivalent  (Theorem  1)  -  can  be  calculated  from 
<ds  by  using  the  representation  co^R^-P)  and  \[/=R0(a)S(cx,  c^R^-a)  (Lemma  1  and  2). 


Optimizing  the  orbit  generating  transformations  ms 

cos  =  R0(a)S(cx11,  cyn)R0(-a)R0(n(-p)  can  now  be  optimized  in  the  following  way  : 

Since  p  can  also  be  calculated  as  pk=^(pkfR0(ak)  S(c_1x,c_1y)  R0(-«k)(pk+1))  and  k=l,2,3 
=>  a  new  p  =  (P1+p2+P3)/3  can  be  obtained 
Now  a  new  cx,  cy  and  a  is  recalculated  using  pk  to  compute  yk: 

=>  cx  =  (c1x+c2x+c3x)/3,  cy  =  (cly+c2y+c3y)/3  and  a  -  (a1+a2+a3)/3 

where  c and  ak  =arctan^i 


for  i|/k  =  ®sWk) 


rk  % 
%  'k 


'  Ro(ak)  S(ckx,  Cty)  Ro(-ak)  and  k  as  in  Lemma  2 


Reflection  transformations  (IFS-Codes)  -  Either  cx  or  cy  is  negative 
If  two  best  fitting  a-Hulls  are  found  which  are  formed  by  the  log  spirals 
ffl,2n  (p,)  =  Ro(a)  S(cx2n,cy2n)  Ro(-a)  Ro(2n(-P»  or 
©22n+1  (p’j)  =  Ro(a)  S(cx2n+1,cy2n+1)  Ro(-«)  Ro(-P(2n+l)) 
for  n  e  N°  resp.  1/n  e  N  and  p,,  p',  eR2,  one  affine  reflective  map  can  be  derived  : 

a12=R0(a)S(-cx,cy)R0(-a)R<)(-p)  resp.  ©12=R0(a)S(cx,-cy)R0(-a)R0(-p), 
where  cx<0  resp.  cy<0,  if  R0(y)©1n(p,)=©2n(p'1)  resp.  Rf)(-Y)®in(p,)=®2"(p'1)  for  y>0. 
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2. 4  How  to  calculate  IFS-Codes 
Deriving  the  IFS-Codes  from  boundary  orbits 

Form  larger  and  larger  curved  a-Hulls,  where  the  cx  and  cy  are  computed  according  to 
Lemma  5  by  using  smaller  and  smaller  k  resp.  1/k  (for  Rek(?  in  the  log  spiral  equivalent 
of  Theorem  1)  for  a  fixed  a  (Fig.  2). 

Each  time  you  get  a  best  fitting  hull  (at  least  four  boundary  points  of  an  attractor  A  lie 
on  the  edge  of  the  cr-Hull)  the  ©s  -  generating  that  orbit  defined  by  these  4  points  -  is  one 
IFS-Code  of  the  IFS  Wm  for  a  subobject  lying  outside  (i.e.  having  a  part  of  its  boundary 
in  common  with  the  object)  as  long  as  <os  4  ©s,  (Fig.  2).  I.e.  Wi+1  =  W;  u  {©s}  for  i  = 
{1,  m},  if  ©s  4  ©§*  where  ©s,  e  Wj  and  g>s  =  ©s,,  if  ©s,  =  <dscos„  where  ©s«  e  Wj 


Figure  2.  IFS-Code  calculation  by  orbit  generating  transformations 

Besides,  this  extremely  simplifies  the  classification  process  (cf.  introduction)  because 
larger  orbits  are  found  first  and  the  smaller  ones  have  not  to  be  put  into  the  classes  any 
more.  Using  con  =  Rg(a)S(cxn,  cyn)R0(-a)R0(n(-(3),  several  special  cases  can  be  apriori 
excluded  and  thus  unnecessary  classes  are  not  formed: 

•  Multiples  of  other  orbits  are  detected  if  S(c'x?  c'y)  -  S(cxn,  cyn )  n  €  G  resp.  if  as  =  as, 

•  The  orbit  generating  transformations  ©s  of  the  largest  orbits  of  one  class  is  that  co’s 
with  the  largest  p. 

•  Orbits  which  can  be  affinely  mapped  (by  inter  orbit  transformations)  onto  eachother 
will  now  be  detected  by  those  cos  having  a  multiple  in  p  -  without  using  an  affine 
invariant  representation7  8. 

Thus  in  the  classification  process  it  is  not  longer  necessary  to  use  classes  of  orbits  but 
classes  of  the  orbit  generating  and  inter  orbit  transformations  represented  by  their  log 
spiral  equivalent  (of  Theorem  1 ) 

Deriving  the  IFS-Codes  from  non  boundary  orbits 

Find  all  cokj  that  for  at  least  one  of  the  spirals  SH  (which  are  formed  by  each  best  fitting  a 
-Hull  of  A  for  the  IFS-Codes  ©n)  circumscribed  to  A\©~1kj  (U^.^  A))  holds  cokj(SH)  = 

Ski  and  Ski  a  circumscribed  spiral  to  A\  U^k-ii  W  «kj  the  IFS-Codes  for  an  inner 
subobject  (i.e.  having  to  less  boundary  points  in  common  with  the  object). 
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Repeat  this  process  until  A\  IJo^  (A)  =  0,  where  {©ki}  ={©k.li}u{©kj}  and  k  -{2,...,n}. 
Note:  Thus  the  IFS-Codes  of  attractors  with  overlapping  subobjects  can  also  be  calculated. 


3  Conclusion 

3 . 1  Summary 

In  this  work  is  shown  how  the  boundary  of  a  discrete  self  affine  object  can  be  calculated. 
This  is  the  proposition  to  form  orbits  of  boundary  points  of  an  object.  Subsequently  the 
orbits  have  to  be  classified  to  calculate  the  IFS-Codes  (affine  maps)  of  a  minimal  IFS  as 
generating  transformations  of  the  largest  of  these  orbits  in  each  class. 

The  boundary  orbits  are  found  by  using  a  generalized  convex  hull  -  the  a-Hull.  This 
hull  is  formed  by  the  intersection  of  generalized  spiral  disk  segments  (parts  of  the  plane 
bounded  by  3  log  spirals  equal  in  curvature  proportions  and  fixed  points)  instead  of  half 
planes  used  for  the  convex  hull.  To  reduce  the  calculation  costs,  an  equivalent 
representation  of  spirals  (in  the  polar  plane)  is  defined  by  powers  of  affine 
transformations  (in  the  x/y-plane). 

This  representation  has  the  additional  advantage,  that  -  by  the  used  affine 
transformations  -  orbits  can  easily  be  calculated  for  the  subsequent  classification. 
Besides,  the  affine  equivalent  to  log  spirals  extremely  reduces  the  expense  for 
comparison  and  differentiation  of  the  orbits  within  the  classification  process. 

Further  more  generating  transformations  for  non  boundary  orbits  can  be  detected  after 
removing  the  subobjects  lying  outside  which  have  been  already  calculated.  Thus  the  IFS- 
Codes  for  subobjects  having  no  boundary  in  common  with  the  object  can  be  computed, 
even  if  the  subobjects  are  overlapping. 

3.2  Future  Work 
Implementation 

This  is  a  theoretical  work,  which  forms  the  basis  for  the  IFS-Code  calculation  and 
classification  resp.  overcomes  the  difficulties  of  their  calculation  existing  so  far. 
Therefore  an  implementation  will  be  made  together  with  the  elaboration  of  the  classi¬ 
fication  part  which  has  to  be  completed  to  solve  the  inverse  IFS-Problem  as  a  whole. 

But  the  prototype  developed  for  the  last  work9  already  shows,  that  in  principle  this 
problem  is  solvable  by  using  a  less  general  hull  similar  to  the  a-Hull  and  by  mapping 
orbits  of  boundary  points  onto  eachother. 

Besides,  the  experience  made  by  the  implementation  of  this  prototype  showed  the 
need  to  have  a  more  general  tool  (the  a-Hull)  to  calculate  and  classify  the  orbits  as  basis 
for  the  computation  of  a  minimal  IFS. 

Analysing  IFS-attractors  under  the  aspects  of  the  O-Hull 

So  far  the  affine  equivalent  of  log  spirals  (the  basis  of  the  a-Hull)  has  primarily  be  used 
to  calculate  the  boundary  of  an  IFS-object.  But  it  can  also  be  an  instrument  to  categorize 
IFS-objects  according  to  the  intrinsic  properties,  such  as  structure,  location  and  form  of 
the  object  parts  (subobjects)  resp.  orbits. 

Therefore  the  correlation  between  orbits  and  the  parameters  of  the  a-Hull  such  as 
scaling  factors  (and  powers  of  them)  -  determining  the  curvature  of  the  orbits  -  and  the 
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two  angles  -  one  determining  the  distance  of  orbit  points  and  the  other  determining  the 
contraction  of  an  orbit  -  have  to  be  studied  in  further  detail. 

The  gained  knowledge  will  also  improve  the  IFS-calculation  and  contribute  to  a  better 
adaptation  of  the  o-Hull. 

Can  the  classification  process  be  more  improved  by  the  o-Hull  ? 

Some  implications  of  the  parameters  of  the  a-Hull  for  the  classification  have  been 
already  analysed  in  this  paper,  but  there  should  be  more  implicit  properties  of  the  a-Hull 
which  can  be  used  to  classify  the  orbits. 

Besides  the  choice  of  the  order  -  not  only  in  dependency  of  a  decreasing  function  of 
the  curvature  proportion  -  by  which  the  a-Hulls  are  formed  can  influence  the 
discrimination  process  of  the  detected  orbits  to  be  classified. 

Last  but  not  least  the  affine  equivalent  to  log  spirals  can  be  used  -  within  the 
classification  -  to  decide  if  one  matrix  can  be  divided  by  another  resp.  for  the  comparison 
of  matrices  with  respect  to  their  fixed  points. 

Designing  a  more  adequate  o-Hull 

Based  on  the  present  knowledge  about  orbits  of  boundary  points  of  IFS-objects,  the  basic 
element  of  the  a-Hull  was  designed  as  the  intersection  of  3  spiral  disks  equal  in 
curvature  proportion  and  fixed  point.  But  in  praxis  it  may  turn  out  that  more  or  less  log 
spirals  with  different  curvature  and  fixed  point  will  do  a  better  job. 

This  variations  also  will  change  the  size  of  the  basic  element,  what  will  influence  the 
number  and  accuracy  of  orbits  found  on  the  boundary.  That  is,  roughly  speaking  how 
"tight"  c-Hulls  can  be  formed. 

So  far  only  two  types  of  basic  elements  have  been  investigated.  An  intersection  of  two 
log  spiral  disks  having  the  same  curvature  and  the  same  fixed  point.  But  -  though 
simpler  to  handle  -  this  leads  to  basic  elements  infinite  in  size,  which  are  not  sufficient  to 
circumscribe  an  IFS-object  in  several  cases. 

Another  basic  element  was  formed  by  two  log  spiral  disks  identically  curved  in 
opposite  direction  and  having  the  same  tangent  in  their  two  tangent  points.  This  leads  to 
more  consistent  (with  respect  to  the  definition  of  hulls)  mathematical  model  of  the  a- 
Hull,  but  this  basic  element  is  convex  and  using  it’s  complement  again  leads  to  less 
"tight"  a-Hulls,  which  prevent  that  the  fixed  points  of  orbits  to  lay  on  the  boundary. 

Generalization  of  the  O-Hull 

In  general  each  discrete  object  can  be  represented  by  an  IFS  as  long  as  sufficient  many 
IFS-Codes  are  used  (in  the  extreme  case  one  IFS-Code  per  pixel  !).  Decomposing  an 
object  in  adequate  parts  will  reduce  the  number  of  IFS-Codes10.  So  the  question  is  not  to 
design  more  and  more  complex  hulls  (bounded  by  highly  non  linear  curves11),  but  to  use 
the  a-Hull  to  find  an  minimal  number  of  orbits  fitting  best  into  the  calculated  boundary 
points.  Two  possible  solutions  are  outlined  below: 

One  to  enlarge  the  pixel  size  by  an  appropriate  scale  and  to  fit  the  orbits  into  less 
boundary  points.  Another  to  make  the  pixels  smaller  -  so  that  each  point  becomes  a 
boundary  point  and  to  find  all  different  orbits  within  this  unconnected  point  set.  The 
second  approach  seems  to  be  more  complicated  (complexity  in  the  point  relation),  but  it 
solves  the  encoding  problem  in  one  step,  because  there  are  no  subobjects  lying  inside, 
anymore. 
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Multifractal  spectra  are  derived  for  1-  and  2-dimensional  cross-sections  of  passively 
convected  scalar  fields;  2-  and  3-dimensional  single-scale  velocity  fields  in  the  ab¬ 
sence  of  KAM  surfaces  are  considered.  Both  the  Kraichnan  model  and  real  flows 
with  non-zero  correlation  time  are  studied.  The  calculation  of  / (a)-curves  is  based 
on  the  probability  density  function  of  the  stretching  factors  of  fluid  elements.  It 
is  shown  that  strict  multifractality  holds  only  for  small  values  of  a.  New  multi¬ 
fractal  scalar  field  —  “harmfulness” —  is  suggested  to  describe  the  propagation  of 
environmentally  dangerous  substances. 

Keywords:  Turbulent  diffusion,  passive  scalar,  intermittence,  multifractality. 


1  Introduction 


The  convection  of  passive  scalar  by  chaotic  fluid  flow  has  been  studied  intensively 
during  the  last  decades.  Special  attention  has  been  paid  to  the  analysis  of  the 
intermittent  structure  of  scalar  fields  1  8.  The  scalar  density  correlation  properties 
have  been  found  to  be  very  far  from  Gaussian;  a  clear  evidence  for  this  is  the 
multifractal  structure  of  scalar  dissipation  fields  1-4 

Most  of  the  theoretical  studies  have  been  based  on  the  Kraichnan  model  5,  in 
which  case  the  velocity  field  is  assumed  to  be  delta  correlated  in  time.  Despite  of 
being  very  idealized,  this  model  is  far  from  trivial  and  is  widely  believed  to  repro¬ 
duce  the  most  important  features  of  the  real  passive  scalar  turbulence.  However, 
this  rigorous  approach  has  still  been  unable  to  relate  the  multifractal  spectrum 
/(a)  directly  to  the  correlation  functions  of  the  velocity  field.  The  main  tool  for 
the  multifractal  analysis  has  been  the  generalized  Baker  map  model  [which  has  been 
also  used  to  calculate  the  probability  density  function  (PDF)  of  largest  Lyapunov 
exponents]  2’3.  Recently,  new  approach  has  been  suggested  for  the  multifractal 
analysis  of  the  passive  scalar  dissipation  field,  which  is  based  on  a  simple  equation 
describing  the  stretching  of  fluid  elements  9.  In  Sections  2  and  3  we  extend  the 
approach  to  three-dimensional  geometry.  In  Section  4,  we  define  new  scalar  fields, 
which  are  based  on  the  density  of  the  passive  scalar,  and  show  that  they  have  also 
multifractal  structure.  When  the  convecting  velocity  field  is  oceanic  or  atmospheric 
motion,  and  the  passive  scalar  is  an  environmentally  harmful  substance,  these  fields 
can  be  treated  as  the  measures  describing  the  potential  damage  to  the  environment. 

The  parameters  of  our  model  equations  are  defined  by  the  statistical  properties 
of  the  velocity  field.  In  the  case  of  delta-correlation  in  time  (Kraichnan  model), 
they  are  directly  expressed  via  the  correlation  functions.  The  model  is  in  good 
agreement  with  the  earlier  experimental  and  theoretical  results. 
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2  Basic  equation 

The  convective  diffusion  of  a  passive  scalar  (j>  is  described  by  the  following  equation: 

dd>  9 

+  v  *  V0  =  kV2</>  +  g .  (1) 

Here  ^  is  a  source  of  passive  scalar  and  k  —  the  molecular  diffusivity.  We  consider 
a  chaotic  isotropic  single-scale  incompressible  velocity  field  v{r)  t)  in  the  absence 
of  KAM  surfaces.  Single-scale  field  is  defined  as  a  field  for  which  the  Fourier 
spectrum  is  constrained  into  one  octave  of  wavelengths.  The  characteristic  space- 
scale  is  chosen  to  be  the  unit  length.  We  consider  large  Peclet  number  limit,  k 
(\v\)  and  moderate  time-scale.  More  precisely,  we  assume  that  the  convection 
has  created  small-scale  structures,  but  the  smallest  created  scales  are  still  longer 
than  the  dissipation  scale  i.e.  t  <  —  lnK/27,  where  7  is  the  average  value  of 
the  largest  Lyapunov  exponent  of  the  velocity  field.  Then,  the  stretching  of  fluid 
elements  and  the  evolution  of  passive  scalar  gradients  are  related  to  each  other 
4,6,10-13  stuciy  the  case  when  there  is  no  source  of  dye,  and  at  the  initial 
moment  t  =  0,  there  was  a  uniform  gradient  of  dye  concentration, 

Q =  0?  V0|t=o  =  exi  (2) 

where  ex  is  the  unit  vector  along  x-axis. 

With  the  given  assumptions  and  for  2-dimensional  velocity  field,  the  problem 
of  finding  the  PDF  of  passive  scalar  gradients  is  directly  equivalent  to  the  problem 
of  finding  the  PDF  of  stretching  factors  of  fluid  elements  p(r,t)  =  exp(/i+£)  cos</?, 
where  is  the  largest  Lyapunov  exponent  and  ip  —  the  angle  between  the  re¬ 
spective  eigenvector  and  a  fluid  element  9.  Indeed,  neglecting  the  seed  diffusiv¬ 
ity,  at  a  fixed  fluid  particle,  the  modulus  of  the  dye  gradient  |V^>|  scales  like  the 
length  of  an  infinitesimal  fluid  element  5r(t),  initially  perpendicular  to  the  gradient 
[5r(0)  _L  V0(O)]:  \k(t)\  oc  |£r(t)|.  This  is  due  to  the  incompressibility  of  the  fluid: 
the  fluid  parallelogram  defined  by  initially  perpendicular  vectors  8r(t)  and  Sr±(t) 
preserves  its  area  6S  —  |6r(t)|  *  |5rj_(t)|  sin  a,  where  a  is  the  angle  between  the  vec¬ 
tors.  On  the  other  hand,  at  the  fixed  fluid  particle,  the  dye  concentration  remains 
unchanged  and  |5ri(t)|  sin  a  oc  [V^p1.  Thus,  with  the  proper  choice  of  units  and 
in  the  absence  of  seed  diffusivity,  the  stretching  factors  p  =  \5r(r0,t)\  /  |5r(r0,0)| 
and  dye  gradients  are  equivalent  to  each  other.  The  strict  equivalence  holds  for  the 
fluid  elements,  initially  parallel  to  the  isolines  of  0(r,O);  however,  statistically  the 
initial  orientation  of  the  fluid  elements  becomes  irrelevant. 

Three-dimensional  geometry  can  be  treated  in  a  similar  way,  then  the  PDF  of 
dye  gradients  is  equivalent  to  the  distribution  of  stretching  factors  of  fluid  surfaces. 
The  argumentation  is  completely  analogous  and  based  on  the  evolution  of  fluid 
parallelepipeds. 

The  multifractal  analysis  is  based  on  simple  diffusion- convection  equation,  de¬ 
scribing  the  distribution  of  fluid  elements  l(p)  and  fluid  surfaces  s(p)  over  the 
stretching  factor  p.  We  derive  it  both  for  the  Kraichnan  model  and  for  velocity 
fields  of  finite  correlation  time.  First  we  consider  the  case  of  real  velocity  fields  of 
finite  correlation  time  r.  The  correlation  time  is  used  as  the  unit  time,  i.e.  r  =  1. 
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Let  us  define  l(p,t)dp  as  the  average  total  length  of  those  pieces  of  a  fluid  line, 
for  which  p  €  [p,  p  +  dp]  (the  length  is  reduced  to  the  initial  length  of  the  fluid  line 
L0).  Then,  the  PDF  of  stretching  factors  and  dye  gradients  is  given  by  p-1Z(p).  We 
consider  time  increments  At  =  1 ,  and  study  the  change  of  the  state  of  fluid  elements 
(fluid  surfaces);  we  neglect  the  correlation  in  time  for  time-scales  longer  than  r  —  1. 
Let  p(q)dq  denote  the  probability  of  stretching  a  fluid  element  by  a  factor  of  q.  Then 
we  can  write  l(p,t  +  1  )dp  =  J  p(q)l(p/q,  t)d(p/q)qdq,  or,  introducing  a  =  lnp  and 
A(cr,  t)  =  Z  (export), 

A(a,  t  +  1)  =  J  A(<7  -  In  q,  t)p(q)dq.  (3) 

In  the  case  of  3D  velocity  fields,  the  same  equation  describes  the  evolution  of  fluid 
surfaces,  with  A(cr,  t)  =  s(exp  a,  t).  However,  for  a  fixed  velocity  field,  the  stretching 
of  fluid  elements  and  surfaces  are  characterized  by  different  functions  p(q). 

The  initial  condition  (2)  implies  that  the  dye  gradients  are  initially  delta- 
distributed.  Therefore,  in  order  to  keep  the  equivalence  between  the  stretching 
coefficients  and  dye  gradients,  the  initial  condition  for  Eq.  (3)  should  be  written  as 

A(cr,  0)  =  £(<r).  (4) 

This  system  can  be  solved  by  applying  the  Fourier  transform  A(/,i)  = 

f  A (cr,  t)  exp (-ifa)da.  As  a  result  we  obtain 

A (/,  t)  =  II (/)*,  n(/)  =  J  p(exp  q)  exp {q  -  ifq)dq.  (5) 

On  the  long-time  limit,  the  inverse  Fourier  transform  can  be  taken  via  the  saddle- 
point  method: 

A  »  ( d2F/tdf2)-lf 2  exp{F[/o(h),  h]t}. 

Here  we  have  denoted  h  =  a/t ,  F(/,  h )  =  ln[/J^  p(exp  q)  exp (q  +  fq)dq]  —  fh,  and 
f0(h)  stands  for  the  solution  to  the  equation  dF/df  =  0.  This  result  has  the  same 
form  as  that  of  obtained  for  the  PDF  of  largest  Lyapunov  exponents  p(hm,t)  via 
generalized  Baker  map  model  3.  It  should  be  stressed  that  although  the  functions 
A (h)  and  p(hm,t)  have  similar  meaning,  they  are  in  fact  distinct,  even  asymptot¬ 
ically  at  t  — ►  oo.  This  is  caused  by  the  contribution  of  fluid  elements,  almost 
perpendicular  to  the  eigenvector  of  the  largest  Lyapunov  exponent.  Note  that  the 
function  F[fo(h),h]  is  defined  by  the  correlation  properties  of  the  velocity  field. 
For  a  typical  localized  correlation  function  p(fc),  function  F(/,  h )  grows  linearly  at 
/  — ►  ±oo.  Consequently,  fo(h)  is  defined  only  for  a  finite  range  of  the  values  of 
h.  Outside  of  that  region,  A  =  0.  This  is  quite  a  natural  result:  for  real  non- 
Kraichnan  flows  with  finite  amplitude  of  velocity  fluctuations,  the  stretching  rate 
of  a  fluid  element  cannot  be  arbitrarily  fast. 

For  Kraichnan  flows  and  for  real  flows  on  the  long-time  limit,  Eq.  (5)  can  be 
further  simplified.  Indeed,  on  long-time  limit,  the  function  A  (cr)  has  a  smooth 
profile  made  up  of  long- wave-length  Fourier  components.  Thus,  the  function  n(/) 
can  be  expanded  into  power  series,  where  only  the  two  first  terms  are  kept,  II(/)  « 
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—iuf  -  Df2.  Here  u  and  D  are  constants  depending  on  the  statistical  properties 
of  the  velocity  field: 


In  kdk, 


(6) 


Further,  the  discrete  increments  become  relatively  small  and  can  be  replaced  by 
time  derivative.  Then,  Eq.  (3)  can  be  rewritten  as 


dX  OX  B2X 
Tt+u^  =  Dd*t 


For  the  Kraichnan  model,  this  is  the  exact  result,  which  can  be  obtained  via 
the  Fourier  transform  of  the  expression  A(cr,t)  =  (5[a  -  ln|<Sr(t)|]).  Here  8r(t)  = 
5r(0)  exp (f*  Vvdt'),  and  the  limit  t  ->  0  is  to  be  studied.  Then  n(/)  =  -iuf-Df2, 
with 


U5(t  =  i fi  ([twW  +  VTT(t)}{vrT(t')  +  VTr(t')))  , 

D5(t  ~t')  =  ±  {vrr(t)vrr{t'))  , 


where  indices  r  and  r  denote  the  components  of  the  tensor  Vv,  and  d  =  2, 3  is  the 
dimensionality  of  the  space.  For  3-dimensional  stretching  of  fluid  surfaces,  A(a,  t)  = 
(5[a  -  In  |5£(£)|]),  where  5S(t)2  =  6r1(t)25r2(t)2  -  [5rx(t)  ^5r2(t)}2,  and  Srh2(t)  = 
^1,2(0)  exp(JQ  Vvdtr).  Unlike  in  the  case  of  finite  correlation  time  flows,  the  values 
of  u  and  D  for  3-dimensional  stretching  of  fluid  surfaces  are  equal  to  the  respective 
values  for  fluid  elements.  This  is  caused  by  two  circumstances:  first,  the  relevant 
infinitesimal  increments  of  fluid  elements  satisfy  the  incompressibility  condition; 
second,  only  the  symmetric  components  of  the  stress  tensor  are  involved.  Typically, 
the  stretching  of  fluid  elements  is  defined  by  the  largest  Lyapunov  exponent  A_|_ 
(except  for  small  fraction  of  fluid  elements,  almost  perpendicular  to  the  eigenvector). 
The  stretching  of  fluid  surfaces  is  governed  by  the  sum  A+  4-  A0,  where  A0  is  the 
intermediate- valued  Lyapunov  exponent.  Thus,  the  statistical  equality  of  the  two 
stretching  coefficients  means  that  the  average  value  of  Aq  is  zero.  Finally  we  note 
that  in  the  case  of  the  Kraichnan  model,  the  time  is  measured  in  arbitrary  units, 
because  the  correlation  time  r  =  0. 


3  Multifractality  of  the  passive  scalar  dissipation  field 

The  passive  scalar  dissipation  field,  created  by  turbulent  jet  has  been  found  to 
exhibit  a  multifractal  structure  1.  This  experimental  finding  has  been  addressed 
in  several  theoretical  studies  2-4 .  The  analytic  results  confirm  the  presence  of  a 
multifractal  structure.  The  stretching-coefficient-based  approach  has  been  used  to 
calculate  the  multifractal  spectrum  f(a)  for  two-dimensional  velocity  field  9.  Here 
we  extend  this  approach  to  3-dimensional  geometry. 

Throughout  this  section,  A(cr,  t )  will  be  treated  as  the  PDF  of  dye  gradients, 
i.e.  a  =  In  \V<f>\.  We  consider  the  initial  conditions  (2),  which  is  equivalent  to  (4). 
Bearing  this  in  mind,  Eq.  (7)  can  be  immediately  solved, 


A  =  (irDt)  x!2  exp 


(<r  —  ut)2 
Dt 


(9) 
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This  expression  allows  us  to  derive  the  multifractal  spectrum.  It  is  convenient  to 
make  use  of  the  pattern  formed  by  fluid  curves  (or  fluid  surfaces  —  for  3-dimensional 
geometry),  which  were  originally  straight  lines  (plane  surfaces),  separated  by  unit 
length,  and  perpendicular  to  the  gradient  of  the  dye  concentration.  Note  that  the 
idea  of  studying  a  fluid  line  evolution  has  been  used  to  calculate  the  Kolmogorov 
entropy  in  2-dimensional  quasi-stationary  flow  14 . 

Experimentally,  the  multifractality  has  been  observed  on  1-  and  2-dimensional 
cross-sections  of  the  3-dimensional  dissipation  field  l.  Here  we  shall  study  1  and 
2-dimensional  cross-sections  of  2-  and  3-dimensional  dissipation  fields.  We  start 
with  1-dimensional  cross-sections,  particularly  we  consider  the  dependence  of  the 
local  value  of  cr  on  the  coordinate  £  along  the  cross-section.  First  we  note  that 
the  characteristic  fluctuation  amplitude  of  the  dye  concentration  is  1.  Indeed, 
when  the  fluid  lines  (or  fluid  surfaces)  evolve,  they  will  be  folded;  typically,  the 
density  variations  of  the  order  of  unity  are  embraced  between  two  approaching 
each  other  pieces  of  the  curve.  Thus,  on  the  cross-section,  the  characteristic  scale 
of  dye  density  variations  is  5  ~  1/IV^I  ~  exp(— <r).  The  small-scale  variations 
of  the  function  <r(£)  are  described  by  the  same  scale.  However,  the  function  cr(£) 
exhibits  long-range  correlations,  as  well,  because  two  close  each  other  pieces  of 
a  fluid  curve  are  stretched  in  a  similar  way.  It  can  be  argued  that  in  rescaled 
coordinates  C  =  f*  €'  exp[<T(£')]d£'t  function  <r(C)  is  a  random  Brownian  function. 
Indeed,  the  distance  AC  between  two  fluid  particles  gives  us  the  estimate,  how 
long  (i.e.  how  many  durations  of  the  correlation  time)  has  been  that  time-period, 
when  these  points  evolved  in  an  uncorrelated  manner.  During  each  correlation 
time,  a  fluid  element  is  randomly  stressed  or  stretched  leading  to  the  change  A<r  ~ 
±1.  Therefore,  AC  gives  us  the  estimate,  how  many  times  are  the  fluid  elements 
independently  stressed  or  stretched. 

Further,  the  multifractal  structure  of  1-dimensional  cross-sections  of  2-  and  3- 
dimensional  dissipation  fields  can  be  easily  analyzed.  The  overall  scalar  dissipation 
in  a  region  +  r],  r  <  1  can  be  estimated  as  Wr(£)  =  J^+r  n\Vip\2dt;  ~  nk28o  = 
k kr,  where  kr  is  the  maximal  value  of  |VV>|  over  the  given  region,  and  S0  =  1  /kr. 
In  order  to  determine  the  multifractal  spectrum  /(a),  we  need  to  calculate  the 
probability 

p(r,  a)  oc  (10) 


that  the  normalized  dissipation  iur(£)  =  Wr(f)/Wi(£)  scales  as  a-th  power  of  r,  i.e. 
wr  €  [ra,2ra].  Substituting  the  estimate  Wr(£)  ~  ft/cr,  we  obtain  VFr(f)/^i(0  ~ 
kr/k\.  Then,  the  probability  (10)  can  be  calculated  as 


p(r,  a) 


L(po)r,  L(po)r  <  1 
exp[— L(p0)r],  L(p0)r  »  1 


,  po  =  k0ira] 


(11) 


where  L(p 0)  =  l(p)dp  is  the  overall  length  [for  3-dimensional  velocity  field,  this 

is  the  overall  surface  L(po)  =  s(p)dp]  per  unit  area  of  those  parts  of  the  fluid 
curves  (fluid  surfaces),  which  are  stretched  more  than  a  prefixed  factor  po-  Indeed, 
L(po)  is  the  estimate  for  the  number,  how  many  times  a  cross-section  of  unit  length 
is  intersected  by  the  fluid  curves  (fluid  surfaces  in  3D  geometry)  of  stretching  factor 
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p>  Pq.  According  to  (9), 


T  1 

L  =  -  exp 

1  erf*70  +  +  D^t 

2 

Lv  v  j 

y/Dt 

(12) 


where  ao  =  lnpo-  At  large  values  of  <to,  the  asymptotics  of  Eq.  (12)  is  given  by  L  « 
exp[-(<r0  +  ut)2 / Dt  -  ao][l  +  2(cro  +  ut)/Dt]-l'}  substituting  <jo  =  —a\  lnr|  +  lnfoi 
we  obtain 


p(r,a)  «  rl-<x{yJl+*u/D-OL\\nr\/Dt) 

Here  we  have  also  substituted  the  value  of  k\ ,  which  has  been  calculated  by  noting 
that  p(l,  a)  w  1. 

In  its  strict  sense,  multiffactality  assumes  that  p(r,  a)  is  a  power  law  of 
r.  According  to  Eq.  (13),  this  is  valid  only  for  small  values  of  a,  a 
Dtyjl  +  4u/D/\  ln(ro)|,  where  ro  is  the  smallest  considered  space  scale.  In  that 
case,  expressions  (10)  and  (13)  yield 


f(a)  =  ay/l  +  4u/D.  (14) 

It  should  be  stressed  that  this  expression  assumes  Lr  <  1,  and  hence  f(a)  <  1. 
On  the  other  hand,  slight  deviations  from  multiffactality  in  its  strict  sense  may 
remain  unnoticed  when  performing  numerical  schemes  of  obtaining  multifractal 
spectra.  Bearing  this  in  mind,  it  makes  sense  to  calculate  the  “effective”  multifractal 
spectrum  /(a),  which  might  be  obtained  in  experiments: 


(f(a))  =  ln[p(r0,  a)/r0]/|  lnr0|.  (15) 


Figure  1.  “Effective”  multifractal  spectrum  (/(a)),  defined  by  expressions  (11),  (12)  and  (15); 
u  «  0.2,  D  «  0.35,  r0  «  0.01,  i  «  30. 
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Farther,  we  consider  2-dimensional  cross-sections  of  3-dimensional  dissipation 
field.  The  cross-sections  of  the  fluid  surfaces  are  curves;  in  what  follows,  they  will  be 
referred  to  as  “stripes” .  The  variations  of  the  dye  gradient  are  highly  anisotropic: 
the  characteristic  scale  across  the  “stripes”  is  previously  given  by  5  «  exp(-a) 
and  is  typically  defined  by  the  smallest  Lyapunov  exponent  h-.  Meanwhile,  the 
characteristic  scale  along  the  “stripes”  is  much  longer  and  is  either  defined  by  the 
intermediate- valued  Lyapunov  exponent  ho  (if  ho  <  0),  or  is  of  the  order  of  unity 
(if  ho  >0).  The  easiest  way  to  obtain  the  /(a)- curve  is  to  consider  1-dimensional 
cross-sections  of  the  2-dimensional  field.  The  fractal  dimension  of  the  intersection 
of  two  fractal  objects  is  given  by  Dn  =  D\  4-  D<i  —  D,  where  D\  and  Z>2  are  the 
dimensionalities  of  the  objects,  and  D  is  the  topological  dimension  of  the  embedding 
space  15 .  Before  applying  this  relationship,  it  should  be  pointed  out  that  on  1- 
and  2-dimensional  cross-sections,  the  same  physical  points  correspond  to  different 
values  of  a.  Indeed,  on  1-dimensional  cross-sections,  the  normalized  dissipation 
was  estimated  as  kr/k\.  For  2-dimensional  geometry,  the  striped  patterns  lead  to 
the  estimate  wr  «  rkr/k i  «  ra.  By  comparing  1-  and  2-dimensional  expressions 
for  kr/k\  (expressed  via  r  and  a),  one  can  see  that  1-dimensional  a  corresponds 
to  2-dimensional  a  -  1  [this  correspondence  is  based  on  an  implicit  assumption 
that  the  intermediate- valued  Lyapunov  exponent  is  not  very  small,  hot  >  -  ln(r0)]. 
Therefore,  f(a)  curves  are  related  by  the  equality  /2(a)  =  1  +  fi(ot  —  1).  Then, 
expression  (14)  leads  to  /2(a)  =  1  +  (a  —  l)\/l  +  Au/D. 

Finally  we  note  that  the  curve  defined  by  expressions  (11),  (12)  and  (15)  is 
quite  similar  to  the  experimental  curves,  which  have  been  obtained  for  the  passive 
scalar  dissipation  in  turbulent  jet 1.  The  leftmost  part  of  the  curve  is  linear,  /  oc  a; 
this  is  the  only  part  of  the  curve  corresponding  to  a  strict  multifractality.  Further, 
there  is  a  rapid  [according  to  Eq.  (11)  exponential]  fall-off  at  the  large  values  of 
a.  Exact  shape  is  not  reproduced,  because  expression  (11)  has  been  obtained 
for  two  asymptotic  limits;  at  the  intermediate  values  of  a,  the  assessment  is  very 
rough.  Reasonable  resemblance  between  the  experimental  1  and  theoretical  curves 
is  achieved  for  the  following  numerical  values:  u  &  0.2,  D  ~  0.35,  ro  «  0.01,  t  «  30 
(see  figure). 

4  The  measure  of  “harmfulness” . 

The  convective  diffusion  is  a  phenomenon,  which  will  appear  in  vast  number  of 
situations.  One  of  the  most  important  aspects  is  the  mixing  of  harmful  substances 
in  natural  environment.  These  substances  may  have  been  delivered  as  a  result  of 
an  accident,  or  as  technological  wastes;  the  convecting  medium  may  be  oceanic  or 
atmospheric  motion.  In  most  cases,  the  overall  amount  of  the  substance  is  rela¬ 
tively  small,  so  that  after  complete  mixing,  there  is  no  danger  to  the  environment. 
Meanwhile,  at  the  moderate  ( under-mixing )  timescale,  there  are  clumps  of  the  sub¬ 
stance;  biological  objects  may  become  damaged,  when  hit  by  these  clumps.  It  is 
not  obvious,  how  to  measure  the  harmfulness  of  this  scalar  field.  The  measures 
based  on  the  moments  of  the  scalar  density  are  not  suited.  Indeed,  at  the  under¬ 
mixing  timescale,  the  seed  diffusivity  can  be  neglected,  and  one  can  use  the  reduced 
description,  where  the  local  state  of  the  medium  is  given  by  one  bit  of  information: 
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the  substance  is  present  (‘1’)  or  absent  (‘0’).  The  local  average  of  the  scalar  con¬ 
centration  is  also  useless.  This  becomes  evident,  when  we  consider  the  evolution 
of  a  striped  pattern.  Due  to  the  incompressibility  of  the  velocity  field,  the  average 
concentration  of  the  dye  will  be  quasi- homogeneous:  in  those  places,  where  the 
stretching  factor  is  high,  the  stripe  becomes  very  thin;  at  the  same  time,  it  will  be 
approached  by  another  piece  of  stripe,  so  that  the  amount  of  dye  per  unit  volume 
will  remain  unchanged. 

Intuitively  it  is  clear  that  the  quantity  which  does  matter,  is  the  size  of  the 
blob  of  dye.  Thus,  we  define  new  scalar  field  x(ri  t)  as  the  radius  of  the  sphere, 
which  is  completely  immersed  into  the  dye,  and  the  center  of  which  is  at  the  given 
point.  This  field  —  or  an  n-th  power  of  it  Xn(r,t)  —  can  be  taken  as  the  measure 
for  the  harmfulness  of  the  admixture.  While  x°(r,t)  (“zero  order  harmfulness”) 
is  essentially  equivalent  to  the  dye  concentration  itself  and  is  distributed  quasi- 
homogeneously,  all  the  other  moments  reveal  multifractal  structure. 

In  order  to  study  the  multifractality  of  the  “harmfulness”,  we  follow  the  ap¬ 
proach  of  Section  3.  Thus,  we  consider  1-dimensional  cross-sections  of  the  scalar 
field.  For  the  sake  of  simplicity,  let  us  study  only  2-dimensional  velocity  field  (3- 
dimensional  geometry  can  be  handled  in  the  same  way,  as  in  the  case  of  dissipation 
field).  Suppose  that  initially,  the  stripes  of  dye  of  width  e  were  straight  lines  sep¬ 
arated  by  unit  distance.  Then,  the  total  n-th  order  “harmfulness”  over  a  region  of 
size  r  is  estimated  as  Hn  =  +r  «  enA-“n~1,  where  kr  is  the  smallest  value 
of  |V^|.  Note  that  this  estimate  is  valid  for  n  >  0  (actually,  for  n  not  too  close  to 
1),  because  we  have  taken  into  account  only  the  contribution  of  the  largest  blob  of 
dye  in  the  given  region.  Strictly  speaking,  the  sum  should  have  been  taken  over  all 
the  blobs.  However,  just  like  in  the  case  of  dissipation  fields,  for  n  >  0,  the  sum  is 
dominated  by  the  largest  term.  Further,  the  multifractal  spectrum  of  dissipation 
fields  was  derived  using  the  overall  length  per  unit  area  L  of  those  parts  of  the  fluid 
curve,  which  were  stretched  more  than  a  prefixed  factor.  Here  we  have  a  symmetric 
situation:  the  main  contribution  to  the  “harmfulness”  is  made  by  those  parts  of  the 
stripes,  which  have  small  stretching  factors.  Therefore,  the  probability  p(r,  a)  can 
be  assessed  by  the  same  formula  (11)  as  in  the  case  of  dissipation  fields;  the  only 
modification  is  that  L  should  be  substituted  by  L  —  the  length  of  those  parts  of  the 
stripes,  which  are  stretched  less  than  k1r~a^n^  times.  Following  the  procedure 
of  obtaining  Eq.  (14),  we  result  in 

fn(a)  =  -^-y/l  +  4u/D.  (16) 

Note  that  while  the  multifractality  described  by  Eq.  (14)  is  caused  by  the  “right- 
hand  tail”  of  the  Gaussian  distribution  function  (9)  —  i.e.  by  the  fall-off  at  a  — >  -fi  oo 
—  the  multifractality  of  the  “harmfulness”  is  due  to  the  left-hand  Gaussian  tail. 
Therefore,  it  is  not  surprising  that  Eqns.  (14)  and  (16)  are  very  similar  to  each 
other. 

5  Conclusions 

We  have  introduced  the  concept  of  “harmfulness”,  which  addresses  the  problem 
of  environmental  damage  caused  by  technological  wastes.  We  have  shown  that  in 
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the  case  of  passive  scalar  turbulence,  both  the  mult ifr act ality  of  the  dissipati 
field  and  “harmfulness”  are  caused  by  uneven  stretching  of  fluid  elements.  For 
multifr  act  ality  of  the  dissipation  field,  this  is  a  known  result  2,4 .  The  advantag 
the  approach  based  on  Eq.  (7)  is  its  simplicity,  which  made  it  possible  to  calct 
the  /(a)-curves.  Eq.  (7)  can  be  also  used  to  study  the  other  aspects  of  pas 
scalar  turbulence.  Thus,  it  has  been  modified  to  take  into  account  the  non- 
molecular  diffusivity  9.  In  such  a  way,  it  was  possible  to  address  the  problerr 
stationary  1  /k  power  spectrum,  and  exponential  decay  of  the  dye  fluctuatior. 
the  case  of  initially  seeded  dye.  However,  it  seems  that  the  approach  canno 
extended  to  study  the  PDF  of  dye  gradients  in  the  presence  of  non-zero  diffusr 
the  dye  gradients  are  affected  by  the  coherence  between  the  existing  patterns 
the  patterns  formed  by  diffusing  dye. 
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We  introduce  a  method  to  generate  statistical  textural  transforms  that  improves  the  treatment  of  non- 
stationarity  and  leads  to  a  sharper  detection  of  the  boundaries  between  distinct  textures  (texture 
segmentation).  This  method  is  based  on  a  sliding  window  processing  with  fixed  size.  The  basic  idea 
proposed  by  the  authors  is  to  readjust  the  measuring  window  around  each  pixel  so  as  to  maximize 
homogeneity.  We  use  this  method  with  the  dimensions  Dn(q)  that  are  derived  from  the  Generalized 
Multifractal  Analysis  formalism,  to  show  that  the  Dn(q) s  can  detect  and  quantify  departures  from 
multifractality,  while  providing  the  analogue  of  the  classical  generalized  dimension  if  the  measure  is 
multifractal. 


1  Introduction 

We  propose  a  method  to  generate  statistical  textural  transforms  that  improves  the 
treatment  of  non-station arity.  Our  goal  is  to  improve  the  quality  of  the  texture 
segmentations1  that  can  be  obtained  from  textural  transforms  based  on  a  simple  sliding 
window  processing2  In  this  context,  one  usually  assumes  implicitly  that  the  signal  is 
locally  stationary  and  then  proceeds  directly  with  parameter  estimation.  This  assumption 
is  usually  not  legitimate  for  every  part  of  the  signal.  Indeed,  there  are  usually  some 
windows  that  are  not  homogeneously  textured,  which  can  result  in  unreliable  texture 
parameters.  We  propose  a  more  careful  treatment  of  the  local  homogeneity  that  leads  to 
significant  segmentation  improvements.  We  also  apply  our  method  to  the  generalized 
multifractal  analysis  (GMA)  representation3  to  show  that  the  generalized  dimensions 
Dn(q)  can  detect  and  quantify  departures  from  multifractality,  while  providing  the 
analogue  of  the  classical  generalized  dimension  D(q)  if  the  measure  is  multifractal. 

2  Non-Stationarity  and  the  legitimacy  of  statistical  texture  parameters 

For  a  ID  signal  S(x),  a  resolution  preserving4  textural  transform  associates  to  each  point 
of  the  x-axis  a  number  that  quantifies  an  aspect  of  the  local  variations  of  S(x)  in  the 
neighborhood  of  x.  The  texture  parameter  associated  to  a  point  x0  is  often  computed  from 
an  interval  centered  on  x0  such  as  [x#-  Lw  /  2,  xo  +  Lw  f  2],  where  Lw  is  the  window  size.  In 
this  case,  we  talk  about  a  sliding  window  processing  of  the  signal.  In  the  following,  the 
function  giving  a  texture  parameter  as  a  function  of  x  will  be  called  a  texture  log.  For 
statistical  textural  transforms,  the  parameter  computed  from  the  signal  is  statistical  in 
nature. 

Texture  logs  can  be  used  to  detect  variations  in  the  local  statistics  of  a  signal.  The 
signals  of  interest  are  therefore  typically  non-stationary.  In  this  paper,  we  focus  on  signals 
for  which  the  textural  variations  are  either  slow  or  abrupt.  In  other  words,  we  consider 
signals  that  are  almost  piecewise  stationary ,  i.e.  nearly  stationary  on  consecutive  disjoint 
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intervals.  In  this  context,  the  presence  of  discontinuities  at  the  boundaries  of  adjacent 
stationary  zones  raises  questions  with  respect  to  the  sliding  window  processing.  Consider 
for  instance  the  simple  signal  shown  in  Fig.  1.  This  signal  is  composed  of  four  segments 
of  equal  size  (500  points).  For  each  segment,  the  signal  was  constructed  by  adding 
uniform  random  deviates  (uncorrelated  and  uniform  on  [0,  1])  to  either  a  constant  or  to  a 
slow  linear  trend.  This  signal  will  be  regarded  as  approximately  piecewise  stationary. 

To  start  with  a  very  simple  example  of  statistical  texture  parameter,  let  us  suppose 
that  we  want  to  obtain  the  mean  texture  log  of  this  signal  with  a  sliding  window 
processing,  i.e.  we  compute  for  each  point  the  arithmetic  average  of  the  window  centered 
on  this  point  (Fig.  2).  As  long  as  the  window  lies  entirely  within  one  of  the  four  segments, 
then  the  data  contained  in  this  window  is  approximately  stationary  (i.e.  statistical 
homogeneity  within  the  window)  and  it  makes  sense  to  compute  the  mean  from  the 
window.  However,  for  windows  that  overlap  between  two  segments,  the  situation  is 
different.  Indeed,  such  windows  then  contain  two  subsets  of  data  that  have  different  and 
inconsistent  probability  distributions.  From  a  statistical  standpoint,  it  is  not  legitimate  or 
meaningful  to  blend  two  statistically  inconsistent  samples  and  then  compute  a  mean. 
Indeed,  the  mean  obtained  is  not  representative  of  any  of  the  two  statistical  ensembles. 
This  simple  example  suggests  that  a  statistical  texture  parameter  should  be  computed 
only  for  windows  that  are  sufficiently  homogeneous  statistically0 ,  otherwise  the  resulting 
parameter  is  neither  representative  nor  statistically  meaningful. 

This  leads  to  the  problem  of  measuring  the  statistical  homogeneity  of  a  window.  We 
will  use  the  simple  approach  that  consists  in  splitting  the  window  in  two  disjoint  sub¬ 
samples  of  equal  size,  and  then  to  apply  a  statistical  test  to  compare  the  distributions 
obtained  from  each  half.  For  this  comparison,  we  will  use  the  Kolmogorov-Smirnov 
statistical  test  that  gives  the  probability  probl  that  the  two  samples  were  drawn  from  the 
same  distribution.  To  include  a  sensitivity  to  2-point  statistics,  we  will  also  compare  the 
distributions  of  the  2-point  products  s(i)  s(i+n)  (n  =  0,  1,2,  ...)  obtained  from  each  half 
window  (here  s(i)  denotes  the  value  of  the  signal  at  point  i).  For  each  n,  the  Kolmogorov- 
Smimov  test  gives  a  probability  prob2(n)  that  the  two  samples  were  drawn  from  the  same 
distribution.  We  then  define  a  homogeneity  index  by 

Homogeneity  Index  =  Min  {probl,  prob2  (1),  prob2(2)t  ...  } 

i.e.  our  homogeneity  index  is  the  most  pessimistic  probability  obtained  from  the  statistical 
comparisons  involving  one  and  two  point  statistics. 

In  Fig.  2  we  plotted  the  mean  texture  log  together  with  the  homogeneity  index  for  the 
sample  of  Fig.  1  (window  size  =  50  points).  The  homogeneity  index  drops  to  extremely 
small  values  at  the  discontinuities  occurring  at  i  =  500  and  1000.  Around  these  locations, 
the  texture  log  takes  the  form  of  a  linear  transition  between  two  plateaus.  Elsewhere,  the 
homogeneity  index  is  large  enough  to  confirm  the  approximate  stationarity  of  the  signal. 
In  our  opinion,  the  windows  for  which  the  homogeneity  index  is  very  low  should  not  be 
used  to  estimate  a  statistical  parameter.  For  this  reason,  we  propose  to  revise  the  idea  of  a 
textural  transform  strictly  based  on  a  sliding  window  processing 


a  In  this  paper,  we  will  use  the  expressions  statistical  homogeneity  and  stationarity  as 
synonymous. 


205 


Position  i 

Figure  1 :  An  example  of  a  signal  that  is  approximately  piecewise  stationary. 
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Figure  2:  Mean  texture  log  and  homogeneity  index  (the  stalactite  looking  signal  on  top). 


3  A  segmentation-oriented  strategy  for  textural  transforms 

On  one  hand,  the  idea  of  associating  a  texture  parameter  computed  from  a  window  to  the 
center  of  this  window  is  not  as  natural  as  it  might  seem  a  priori.  Indeed,  a  texture 
parameter  is  associated  with  a  window,  not  with  a  point.  In  reality,  any  point  belonging  to 
this  window  is  equally  entitled  to  receive  this  texture  parameter,  especially  if  every  point 
has  contributed  equally  to  its  computation.  On  the  other  hand,  the  whole  purpose  of  a 
texture  log  is  to  detect  spatial  variations  of  the  local  texture .  From  this  standpoint,  it  is 
desirable  that  the  spatial  variations  of  the  texture  log  reflect  the  spatial  variations  of  the 
statistics  of  the  underlying  signal.  Attributing  the  texture  parameter  to  the  window  center 
is  a  simple  way  to  obtain  this  dependence. 

In  this  paper,  we  adopt  the  standpoint  according  to  which  the  textural  transform  is  to 
be  used  primarily  for  segmentation  purposes ,  i.e.  to  divide  the  signal  into  zones  having 
similar  statistics.  In  other  words,  the  textural  transform  is  regarded  as  a  first  step  toward 
segmentation.  It  follows  from  this  standpoint,  for  instance,  that  if  there  is  an  abrupt 
transition  between  two  zones,  then  the  texture  log  should  also  exhibit  an  abrupt  transition. 
It  also  follows  that  it  is  necessary  to  associate  a  texture  parameter  with  each  point  of  the 
signal,  i.e.  the  textural  transform  should  be  resolution  preserving,  which  is  not  possible  if 
only  the  window  centers  are  considered  (i.e.  sliding  window  processing).  Finally,  a 
statistical  textural  transform  should  be  based  only  on  windows  that  are  reasonably 
homogeneous  statistically,  as  far  as  possible. 

To  satisfy  these  guidelines,  let  us  examine  again  the  abrupt  transition  between  two 
zones,  (Figs.  1-2).  If  the  window  considered  is  homogeneous  enough,  then  there  is  no 
statistical  inconsistency  and  we  choose  to  adopt  the  usual  sliding  window  method,  i.e.  we 
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attribute  to  a  point  x  the  texture  parameter  of  the  window  that  is  centered  on  x.  However, 
if  the  window  overlaps  between  two  different  zones  A  and  B ,  then  the  window  is 
statistically  inconsistent  and  we  want  to  modify  the  usual  sliding  window  strategy.  If  the 
point  x  considered  belongs  to  zone  A,  then  it  seems  natural  to  attribute  to  x  the  texture 
parameter  of  a  window  that  both  contains  x  and  that  lies  entirely  within  the  zone  A  (see 
point  x3  in  Fig.  3).  Indeed,  x  belongs  to  zone  A  and  should  therefore  be  attributed  a 
texture  parameter  that  is  representative  of  this  zone.  From  this  standpoint,  the  problem  is 
to  select  for  each  point  x  the  most  appropriate  window  among  all  the  windows  that 
contain  x. 

We  propose  the  following  window  selection  rule  for  a  given  point  x.  Among  all  the 
windows  that  contain  x  and  that  have  a  satisfactory  homogeneity  (if  they  exist)  we  select 
the  window  that  is  most  centered  on  x,  i.e.  the  window  for  which  the  distance  between  its 
center  y  and  x  is  minimum.  If  none  of  the  windows  containing  x  is  homogeneous  enough, 
then  we  simply  select  the  most  homogeneous  window  containing  x.  Notice  that  in  general 
there  will  be  two  windows  at  an  equal  distance  |y  -  x|  of  x,  the  left  and  the  right  window. 
To  make  the  choice  unambiguous,  we  select  the  most  homogeneous  of  these  two 
windows.  It  is  emphasized  that  if  the  signal  is  homogeneous  enough,  then  our  definition 
reduces  to  the  usual  sliding  window  approach.  However,  if  the  homogeneity  is  not 
acceptable,  then  our  definition  forces  the  windows  to  “stay  away”  from  discontinuities. 
We  will  refer  to  this  approach  as  the  sliding  window  method  with  homogeneity 
correction,  or  SWMHC. 

Let  us  examine  a  few  consequences  of  our  homogeneity  correction.  If  x  is  closeb  to 
the  beginning  of  the  signal,  then  the  windows  containing  x  cannot  be  centered  on  x  (see 
point  xl  in  Fig.  3).  If  x  is  away  from  the  boundaries  and  in  a  homogeneous  zone,  then  the 
window  selected  will  be  centered  on  x  (point  x2  in  Fig.  3).  If  x  is  close0  to  a  boundary 
between  two  zones,  then  the  window  selected  will  tend  to  be  entirely  within  its  own  zone 
(point  x3  in  Fig.  3). 


Figure  3:  Attribution  of  windows  to  each  point  at  the  boundary  of  two  zones. 

We  applied  this  homogeneity  correction  algorithm  to  the  signal  of  Fig.  1,  setting  the 
acceptable  degree  of  homogeneity  to  0.05  (in  statistics,  it  is  usual  to  reject  a  hypothesis  if 
the  probability  associated  with  a  test  is  smaller  than  0.05).  It  is  seen  in  Fig.  4  that  the 
discontinuities  are  detected  with  perfect  accuracy,  even  with  a  50-point  window  size, 
whereas  the  rest  of  the  log  remains  unchanged  with  respect  to  Fig.  2.  A  minimum 
homogeneity  index  of  0.05  could  be  obtained  eveiy where.  Our  SWMHC  has  the  merit  of 


b  If  the  distance  between  x  and  the  beginning  of  the  signal  is  smaller  than  Lw  /  2,  where 
Lw  is  the  window  size. 

0  If  the  distance  between  x  and  the  boundary  of  the  signal  is  smaller  than  Lw  /  2,  where  Lw 
is  the  window  size. 
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detecting  boundaries  sharply  even  for  large  windows,  which  is  not  possible  with  the  usual 
sliding  window  method.  Our  algorithm  reduces  uncertainties  on  texture  logs  because  the 
most  inhomogeneous  windows  are  rejected.  In  the  following,  the  SWMHC  will  be  applied 
to  the  generalized  dimensions  Dn(q). 


ie-01  x 
1e-02  2 


Position  i 

Figure  4:  Mean  texture  log  with  homogeneity  correction.  The  boundaries  at  positions  500  and  100  are  detected 
with  perfect  accuracy,  in  spite  of  the  fact  that  a  50-point  window  size  was  used. 


It  is  beyond  the  scope  of  this  paper  to  give  a  comprehensive  comparison  of  our 
method  with  the  numerous  other  approaches  to  texture  segmentation,  ranging  from 
statistical  methods,  wavelet  based  methods  and  neural  networks.  In  the  context  of  wavelet 
based  methods,  for  instance,  clustering  procedures6, 7’  8  or  detection  of  sharp  transitions 
over  wavelet  energy  measurements  have  been  used  to  produce  a  final  segmentation. 
These  algorithms  have  good  experimental  performances  but  rely  on  ad  hoc  parameter 
settings9.  The  same  thing  can  be  said  of  the  segmentation  methods  that  combine  statistical 
parameters  and  neural  networks10  because  the  network  parameters  are  adjusted  on  training 
sets. 

One  of  the  first  methods  used  in  texture  segmentation,  and  still  a  major  one,  is  the 
spatial  gray  level  co-occurrence  matrix11.  Many  authors  have  approached  the  problem  of 
texture  segmentation  with  split-and-merge  methods  combined  with  co-occurrence 
matrices1, 5.  In  this  context,  one  creates  homogeneous  regions  by  splitting  inhomogeneous 
regions  into  smaller  regions  until  a  given  homogeneity  criterion  is  satisfied  in  each  region 
(the  regions  can  be  rectangular  if  a  regular  grid  is  used,  or  else  they  can  have  other  shapes 
depending  on  the  space  partitioning  method  selected).  In  comparison,  our  approach 
consists  in  constructing  a  textural  transform  based  on  a  fixed  window  size  (and  shape), 
but  the  window  containing  a  given  point  can  be  moved  continuously  to  satisfy  a 
homogeneity  criterion.  In  addition,  our  choice  of  a  statistical  test  to  assess  homogeneity 
gives  a  certain  objectivity  to  the  homogeneity  criterion. 

For  a  fixed  window  size,  our  method  cannot  always  reach  an  acceptable  degree  of 
homogeneity  everywhere.  However,  reducing  sufficiently  the  window  size  allows 
reaching  an  acceptable  homogeneity  at  all  points  (as  in  the  split-and  merge  algorithm).  An 
appropriate  combination  of  texture  logs  obtained  for  different  window  sizes  can  be  used 
to  produce  a  multi-resolution  texture  log  with  satisfactory  homogeneity  everywhere. 
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4  Brief  review  of  the  generalized  multifractal  analysis  representation 

4. 1  Generalized  dimensions  defined  as  projections  of  the  generating  function  on 

orthogonal  polynomials 

We  summarize  here  the  generalized  multifractal  analysis  (GMA)  representation3.  The 
generating  function  %q(8)  used  in  multifractal  analysis  is  a  function  of  two  variables  q 

and  6.  The  parameter  q  is  real  and  8  >  0  is  a  scale  ratio  8~UL<  1,  where  i  is  a  length 
scale  and  L  is  an  upper  bound  on  L  Multifractal  analysis  can  be  done  on  the  condition 
that  Xq($)  satisfies  the  power  law  property7  Xq($)  -  8T(q)  in  an  interval  <5 
called  scaling  range.  With  the  change  of  variable  x  =  --ln(<5)£0=><5:=exp(-x)  and  the 
definition  <pq(x)  =  -ln(^(exp(-Jt))),  this  property  can  be  rewritten  in  the  linear  form 
<t>q(x)  =  T0(q)  +  rl(q)  x.  Many  signals  cannot  be  described  satisfactorily  with  this  linear 
model.  The  GMA  representation  of  (f>q(x)  solves  this  problem  by  expanding  <f>q(x)  in 
terms  of  orthogonal  polynomials  Pn  (jc)  of  increasing  order  n 

<pq(x)=  'Lr„(q)P„(x)  (1) 

n= 0 

The  model  (1)  is  a  generalization  of  the  multifractal  model.  The  coefficients  Tn(q)  are 
generalizations  of  T(q)\  and  Tx(q)  corresponds  to  the  usual  r(q)  for  multifractals.  In  this 
paper,  we  compute  xq($)  fo: r  all  the  length  scales  available,  i.e.  £fl=n1n~  1,  2, ...,  Nx . 
The  coordinates  x  take  discrete  values  xt,  i  =  1,  2, Nx  and  <pq(x)  is  computed  for  these 
xj  s.  The  Pn  (x)  s  are  orthogonal  with  respect  to  a  scalar  product  that  we  define  for  any 
two  functions / and  g  by 

(f,g)  =  f(xi)g(xi)  (2) 

1=1 

where  the  >  0  are  weights.  A  first  possibility  is  to  use  unit  weights ,  i.e.  w(xt )  =  1 
for  each  i,  which  gives  a  larger  weight  to  large  scales.  Indeed,  the  density  p(x)  of  the 
xj  s  along  the  x-axis  varies  approximately  according  to 
p{xn)  =  \!{xn- xn+l )  =  1  / ln(^w+1  lln)~ln  and  therefore  it  increases  as  in  increases. 
Another  possibility  is  to  use  variable  weights  that  compensate  for  the  non-uniform 
density  of  the  x-  s  by  choosing  w(xn)  =  k I  p(xn)  where  the  constant  k  is  chosen  so  that 
w(x,  )  =  1 .  This  choice  results  in  approximately  uniform  weighting  along  the  x-axis  in 
the  summation  (2).  It  is  this  second  choice  that  we  make  in  this  paper,  mostly  because  it 
was  found  to  lead  to  exponents  Tn(q)  that  are  more  sensitive  to  textural  variations. 

The  P„(jc)s  are  obtained  with  a  Gram-Schmidt  orthogonalization  starting  with 
=  l  and  using  iteratively  the  formula 
Pn(x)  =  xn  -  {Pn(0)+  pn(l)  Pfx)  + ...  4-  pn(n  - 1)  .  Hence  they  satisfy 

(pmpm)  =  f>n,m  (PmPn)  311(1  the  ?*(<?) s  in  (1)  are  defined  by 

T  n(q)  =  (Pn,<t>q)/(Pn,pn) 


(3) 
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The  Tn(q)' s  can  be  alwaYs  written  in  the  form  xn(q)  -{q- 1)  Dn(q)>  where  the  Dn(q)'  s 
are  formal  extensions  of  the  generalized  dimensions  D(q  f.  The  linear  component  Dfq) 
reduces  to  the  generalized  dimension  D(q)  if  the  measure  is  multifractal,  i.e.  if  N  =  1  in 
eq-  (1). 

4. 2  Definition  of  the  generating  function 

We  define  the  measure  of  an  interval  BT(S )  of  size  6  centered  on  a  point  r  by 
pr(S)  =  lr  eB  {5)  S(rt).  To  estimate  uncertainties  on  the  generating  function,  we  always 

consider  a  collection  of  M  equal  size  samples  (Af  >2)  drawn  from  the  same  statistical 
ensemble.  For  a  single  window  in  a  signal,  we  split  the  window  into  four  disjoint  intervals 
of  the  same  size  that  are  regarded  as  four  independent  samples.  Firstly,  we  define  for  each 
of  these  samples  an  “individual”  generating  function  by  (this  form  holds  only  if  pr(S)  >  0, 

see  3) 

Xq(8)  =  8^-l)D([pI(S)]‘‘)s/((pt(8))s)q  (4) 

where  D  is  the  dimension  of  the  embedding  space,  and  angle  brackets  {...)s  denote  a 
spatial  average.  The  denominator  of  (4)  guarantees  that  xq(8)  is  normalized  exactly,  i.e. 
that  Xi(8)  =  1  for  all  S.  Secondly,  we  define  the  “global”  generating  function  xq(8) 
obtained  from  the  M  samples  by 

Xq(.8)=±ixf(8)  (5) 

where  Xq\$)  denotes  the  individual  generating  function  of  sample  The  uncertainty  on 
Xq(8)  is  estimated  by  taking  into  account  the  variations  of  Xq\8)  fr°m  sample  to 
sample,  and  the  corresponding  uncertainties  on  D„(q)  can  be  derived3  (we  assume  that  the 
Xq\8) s  are  uncorrelated).  The  error  formulas  used  in  this  paper  are  (cr(X)  denotes  the 
standard  deviation  of  a  random  variable  X) 

(  \2 

<J2(<pq(x))  =  jj  ^rfyj  ,  <T2(T„^»  =  <P„2,CT2(^)>/((P„,Pn))2 , 
o(Dn(q))  =  <5(1  „(<£>)  l\q  - 1| 

5  Behavior  of  the  Dn(q)s  for  a  transition  between  two  distinct  random 
binomial  measures 

Our  goal  here  is  to  show  that  the  generalized  dimension  Dfl)  reduces  to  the  classical 
information  dimension  D(l)  for  such  archetypal  multifractals.  The  homogeneity  index  is 
computed  as  previously  except  for  one  difference:  The  two  samples  obtained  from  each 
half  of  the  window  are  normalized  by  their  mean  before  the  statistical  tests  are  done.  This 
choice  is  partly  justified  by  the  fact  that  the  two  halves  of  a  binomial  measure  are 
statistically  identical  only  if  each  half  of  the  measure  is  individually  normalized. 

We  construct  the  random  binomial  measures  as  usual.  Initially,  a  unit  mass  is 
assigned  to  the  interval  [0,1].  In  the  first  step,  the  unit  interval  is  split  in  two  halves 
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[0,  1/2]  and  [1/2,  1].  The  first  interval  receives  a  random  fraction  Wx  x  of  the  mass,  where 
0  <  <  1 ,  while  the  second  interval  receives  a  mass  1  -  Wx  l .  In  the  next  steps  this 

splitting  procedure  is  repeated  in  a  self-similar  manner:  In  the  step  n  +  1 ,  interval  /, 
[(/-l)  (1/2)*,/ (1  / 2)n]  of  size  =  (1  / 2)"  and  mass  /?,(«)>  is  split  in  two  halves 

receiving  the  masses  pi  X ( n  +  l)  =  Wn4  p^n)  and  pi  2(n  + 1)  =  (1  -  Wn  i)  pt(n) ,  where  Wn  i 

is  a  random  variable  (l</<2").  The  multipliers  Wni  satisfy  0<Wni<\  and  are 

identically  distributed  and  mutually  independent  random  variables.  We  choose  here 
multipliers  that  take  two  values  W!  and  w2  =  1  -  Wi  with  equal  probability.  The  signal 
examined  (Fig.  5)  is  composed  of  two  adjacent  binomial  measures  with  parameters  Wi  = 
0.25  and  W]  =  0.35,  1024  points  each  (n  =  10,  i.e.  10  iterations).  Each  measure  was 
normalized  to  get  a  unit  root  mean  square. 

We  first  consider  in  Fig.  6  the  Djfl)  texture  log  obtained  with  a  window  size  of 
128  points,  a  maximum  box  size  of  32  points  and  no  homogeneity  correction.  The  dashed 
lines  represent  the  theoretical  values  of  D(l)  for  the  two  binomial  measures  (we  get 
D(l)  =  0.811  and  0.934  with  D(l)  =  -Wj  log^Wj)- w2  log2(w2)).  D2(l)  fluctuates 
around  the  theoretical  value  of  D(l)  on  both  sides,  which  shows  that  the  GMA 
representation  is  consistent  with  the  usual  description  of  multifractals.  It  can  be  noticed 
that  Dj(l)  oscillates  a  little  below  D(l)s  though  the  difference  is  small,  which  can  be 
explained  by  the  fact  that  our  generating  function  <Pq(x)  is  not  perfectly  linear  for  the 
binomial  measure.  The  homogeneity  index  drops  to  very  small  values  at  the  boundary 
between  the  two  measures,  located  at  position  1024.  Within  each  zone,  the  homogeneity 
index  is  relatively  large  but  sometimes  drops  to  small  values.  This  occasional  lack  of 
homogeneity  results  from  the  fact  that  the  statistical  self-similarity  of  the  binomial 
measure  holds  exactly  only  if  the  two  intervals  compared  with  the  statistical  test  match 
exactly  the  construction  grid  of  the  measure,  which  is  usually  not  the  case. 


We  recomputed  the  D2(l)  texture  log  of  Fig.  6  with  the  homogeneity  correction  at 
level  0.05  (Fig.  7).  This  level  was  achieved  at  about  75%  of  locations,  but  the  level  0.04  is 
reached  almost  everywhere.  The  transition  between  the  two  measures  is  sharp  and  its 
location  exact,  in  spite  of  a  window  size  of  128  points.  The  amplitude  of  the  fluctuations 
is  reduced  by  the  homogeneity  correction.  Plateaus  are  formed,  which  is  a  characteristic 
property  of  the  homogeneity  correction.  Higher  order  dimensions,  starting  with  D2(l), 
were  found  to  be  too  small  to  be  distinguished  unambiguously  (taking  into  account  error 
bars).  This  is  expected  because  the  function  <Pq(x)  is  almost  perfectly  linear  for  such 
binomial  measures. 
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Figure  6:  Dj(l)  texture  log.  The  stalactite  looking  curve  on  top  is  the  homogeneity  index. 
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Figure  7:  Dt(l)  texture  log  of  Fig.  5  with  homogeneity  correction  at  level  0.05. 


6  Behavior  of  the  D„(q)s  for  a  transition  between  a  binomial  measures  and  a 
non  self-similar  measure 

In  this  section,  we  show  that  the  GMA  representation  can  be  used  to  detect  and  quantify 
departures  from  self-similarity ,  while  providing  an  analogue  of  the  generalized  dimension 
even  if  the  measure  is  not  self-similar.  We  examine  the  GMA  texture  logs  for  a  transition 
between  a  random  binomial  measure  with  W]  =  0.3  to  a  non  self-similar  measure.  The 
latter  is  constructed  by  making  the  factor  wi  vary  with  the  step  index  n  (with  the 
multiplicative  process  described  in  section  5.1).  Our  non  self-similar  measure  was 
obtained  with  the  following  values  of  Wi(n)  (n=l,2,...,10):  {0.32,  0.27,  0.28,  0.38,  0.25, 
0.25,  0.42,  0.40,  0.11,  0.11}.  The  total  measure  is  obtained  by  sticking  together  this 
measure  with  the  binomial  measure  (Fig.  8). 

The  two  measures  are  distinguished  clearly  by  Dfl)  (Fig.  9)  and  the  homogeneity 
correction  again  results  in  a  sharp  detection  of  the  boundary  (Fig.  10).  The  D2(l)  texture 
log  (Fig.  11)  shows  a  marked  transition  between  the  two  measures.  Indeed,  \D2(1)\  is 
small  on  the  side  of  the  binomial  measure  (left),  and  then  becomes  larger  on  the  side  of 
the  non  self-similar  measure  (right).  A  larger  \D2(1)\  indicates  a  larger  deviation  of  <Pq(x) 
from  a  straight  line,  to  be  expected  for  a  non  multifractal. 
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Figure  11:  D2(l)  without  any  homogeneity  correction. 


7  Conclusions 

For  signals  that  are  nearly  piecewise  stationary,  our  method  for  generating  textural 
transforms  results  in  a  sharper  detection  of  the  boundary  between  distinct  segments. 
Indeed,  the  uncertainty  on  the  location  of  the  transition  is  much  smaller  than  the  window 
size.  These  improvements  have  been  obtained  in  the  context  of  approximately  piecewise 
stationary  signals.  The  special  cases  examined  were  the  local  average  of  a  white  noise  and 
the  generalized  dimensions  Dn(q)  of  adjacent  binomial  measures.  A  better  assessment  of 
the  algorithm  proposed  should  involve  segmentation  tests  in  the  presence  of  different 
types  of  noise,  e.g.  colored  noise,  long-tailed  distribution  noise,  as  well  as  real  signals 
such  as  1-d  cuts  through  images. 

It  is  emphasized  that  our  method  is  quite  general  and  can  be  used  for  any  statistical 
parameter.  Applied  to  the  GMA  representation,  we  demonstrated  that  the  generalized 
dimensions  D„(q)  can  clearly  detect  departures  from  multifractalityd  in  a  signal,  while 
providing  the  analogue  of  the  classical  generalized  dimension  D(q)  for  multifractals. 
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We  study  analytically  the  kinetics  of  growth  of  parallel  needles,  using  a  conformal  trans¬ 
formation  to  set  up  the  iterative  nonlinear  equations.  This  allows  to  build  the  discrete 
Fokker-Planck  equation  for  the  probability  of  finding  at  time  t  a  given  distribution  of 
needle  lengths.  We  consider  here  two  specific  cases:  we  find  the  exact  Fokker-Planck 
equation  for  pairs  of  needles  and  its  solutions,  and  the  linear  behavior  of  a  set  of  n  nee¬ 
dles  with  equal  initial  lengths.  The  corresponding  Fokker-Planck  equations  show  the 
short-wavelength  Mullins-Sekerka  instability  of  these  parallel  needles,  and  the  possible 
structure  of  the  screening  leading  to  the  scale  invariance  of  the  model. 


1  Introduction 


Laplacian  growth  is  an  ubiquitous  process  of  considerable  physical  interest:  start 
of  dendritic  growth  of  crystalline  structures  during  solidification  when  the  diffusion 
length  is  large,  electrodeposition,  viscous  fingering,  dielectric  breakdown  or  growth 
of  bacterial  colonies,  belong  to  Laplacian  growth  and  can  be  described  by  Diffusion 
Limited  Aggregation  models  (DLA)1.  Related  physical  phenomena  include  me¬ 
chanical  cracking,  like  mud  during  drying,  crack  formations  in  pieces  of  materials 
under  strain.  The  Laplacian  field  takes  its  origin  in  the  diffusion  of  material  in 
front  of  the  growing  structures,  particles  which  aggregate,  nutrient  in  the  growth 
of  bacterial  colonies,  or  in  the  electric  field  as  in  electrodeposition  or  in  dielectric 
breakdown.  In  most  situations,  branching  occurs  during  the  growth,  leading  to 
DLA  structures.  In  spite  of  the  apparent  simplicity  of  the  physical  laws  governing 
these  systems,  and  the  very  simple  simulation  models  containing  the  essence  of  the 
physics,  our  analytic  understanding  of  DLA  remains  very  unsatisfactory.  Essence 
studies  have  used  extended  numerical  simulations  to  extract  the  scaling  laws  gov¬ 
erning  these  systems.  Therefore  a  more  modest  but  useful  approach  consists  in  a 
better  understanding  of  DLA  growth  in  the  absence  of  branching2,3.  Again,  in  this 
simpler  problem  of  growing  needles,  we  can  consider  radial  or  parallel  needles,  as 
well  as  reflection  {model  R)  or  absorption  ( model  A 4)  of  particles  (see  for  instance 
Krug5,  for  a  review).  In  this  paper  we  will  concentrate  our  attention  on  systems 
of  parallel  absorbing  needles  which,  we  think,  are  of  more  fundamental  interest.  In 
two  dimensions,  conformal  mapping  allows  to  obtain  analytical  results  as  shown  by 
Derrida  and  Hakim6  for  radial  needles. 

An  important  question  in  such  systems  is  the  competition  between  the  nee¬ 
dles.  The  instability  of  an  initial  distribution  of  equal  lengths  needles,  as  well  as 
unstable  modes  of  a  growing  flat  interface  in  dendritic  growth,  is  known  as  the 
’’Mullins-Sekerka”  instability7.  In  a  recent  experiment,  Losert  et  al8  showed  the 
spatial  period-doubling  instability  of  dendritic  arrays  in  directional  solidification 
as  suggested  theoretically  by  Warren  and  Langer9.  It  will  be  the  purpose  of  this 
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preliminary  paper  to  establish  the  basic  Fokker-Planck  equation  for  the  growth  of 
n  parallel  needles,  in  the  linear  regime,  which  leads  to  the  Mullins-Sekerka  behav¬ 
ior.  We  will  indicate  in  a  few  words,  on  the  basis  of  the  exact  solution  for  pairs  of 
needles,  what  we  can  expect  for  the  general  behavior  of  n  parallel  needles. 

To  study  analytically  the  needle  models  of  Laplacian  growth  in  2-dimensional 
systems,  the  most  convenient  approach  is  to  use  conformal  transformations.  We  will 
only  give  in  Sec.  2,  the  main  arguments  to  set  up  these  transformations,  and  the 
reader  is  referred  to  Shraiman  and  Bensimon10,  Szep  and  Lugosi11,  Peterson  and 
Furry12,  Kurtze13,  and  Derrida  and  Hakim6.  For  the  dynamics,  we  will  follow  here 
Derrida  et  a/.6,14.  In  Sec.  3  we  write  the  closed  set  of  equations  which  describes 
the  time  evolution  in  a  compact  matrix  notation.  In  Sec.  4,  we  solve  exactly  the 
general  two-needles  case,  and  establish  the  associated  Fokker-Planck  equation.  The 
case  of  n  growing  parallel  needles  is  examined  in  Sec.  5,  and  the  linearized  equations 
are  derived  in  Sec.  6.  These  Fokker-Planck  equations  reveal  the  short  wavelength 
instability  leading  to  period-doubling. 

Two  ingredients  are  needed  for  the  Laplacian  growth  :  on  the  one  hand  the 
Laplacian  behavior  determines  the  long  range  interaction  characteristic  of  DLA 
growth,  but  on  the  other  hand  the  inherent  Mullins-Sekerka  instability  could  not 
operate  without  the  presence  of  local  noise.  This  noise  can  be  introduced  6,14  in 
the  initial  state.  We  chose  here  to  consider  a  discrete  model  with  diffusing  particles 
of  finite  size  S£  sticking  to  the  needles  at  discrete  time  intervals. 


2  The  basic  conformal  transformations 

The  classical  way  to  parametrize  in  a  convenient  manner  Laplace’s  equation  with 
a  zero  potential  boundary  condition  on  a  set  of  n  parallel  needles  is  to  introduce 
first  a  transformation  which  maps  the  unit  circle  in  the  complex  plane  2  onto  an 
n-branched  star  in  the  complex  plane  oj: 

u  =  f(z)  =  AzU”ll{l-ei<>i/z)ai  with  EJ=oS  =  2.  (1) 

In  this  transformation,  7r ctj  is  the  angle  between  two  successive  needles  {j  —  l,j}9 
the  sum  of  the  angles  being  2tt  (equ.(l)).  The  angles  Oj  fix  the  lengths  i{  of  the 
needles,  and  A  is  a  parameter  which  will  be  determined  below. 

A  second  transformation  maps  the  star  in  the  complex  plane  oj  into  a  set  of 
parallel  needles  in  a  complex  plane  0  : 

O  =  logu;.  (2) 

In  the  plane  Q,  the  Laplacian  field  is  (fi)  =  Re  [log  (/_1  (expO))]  . 

The  tip  positions  in  plane  z  are  parametrized  by  the  angles  fa  : 

Zi  =  exp  (ifa)  ,  0  <  i  <  n  -  1  ,  (3) 

and  the  lengths  of  the  needles  are  the  modulus  of  log  (f(z))  at  the  points  z  =  Z{: 

ti  =  log  (4 A)  +  ^T._o  aj  log  |sin  ((fa  -  Oj)/ 2)|  ,  V  0  <  i  <  n  -  1  .  (4) 
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The  n  additional  constraints  must  be  imposed  onto  the  angles  (pi  and  Oj  which  take 
into  account  the  fact  that  the  needle  tips  maximize  \f  (z)\  at  z  =  sa: 


yn  1  cij  cot  (((j>i  -  0j)/2)  =  0,V0<i<n  —  1.  (5) 

Now  for  the  kinetics,  the  growth  rate  is  supposed  to  be  proportional  to  the  potential 
gradient  along  the  needles.  In  addition,  the  growth  is  supposed  to  be  restricted  to 
the  tips.  Therefore,  following  refs.6,14,  where  only  the  tips  grow,  while  the  needles 
remain  at  zero  potential  ( model  A ),  the  growth  rate  of  the  needles  is: 


dlj 

dt 


<x 


lU—  1 


0  OLj  (l  +  cot2  ({(pi  -  Oj)/ 2)) 


-1/2 


,V0<i<n  —  1. 


(6) 


This  equation  will  be  used  to  determine  the  growth  probability  of  the  needles. 

Now  we  have  all  the  ingredients  necessary  to  build  the  recursion  relations  to 
describe  the  evolution  of  our  system. 


3  Matrix  form  of  the  basic  equations 

It  appears  first  very  convenient  to  introduce  the  matrix  C  —  {cij}o<ij<n-i  where: 

Cij  =  cot  ((<f>i -9 j)/2)  (7) 

and  condition  (5)  may  be  simply  written  (~ct  is  the  vector  {a'i}0<i<n-i): 

C.  at  =  _0>  .  (8) 

In  a  similar  way,  we  can  define  D  =  {dy}0<ij,<n_i  where  dij  =  1  +  c?-.  Then 
condition  (6)  is  : 

dti/dt  oc  {D.  a?}”1^2.  (9) 

I 

3.1  Infinitesimal  growth 

We  consider  now  that  between  t  and  t  A-  St,  a  particle  of  size  SI  sticks  on  needle 
i.  This  increase  of  length  is  supposed  much  smaller  than  the  distance  between 
needles,  n ctj.  The  equations  can  be  linearized:  for  a  variation  of  the  length  li 
corresponds  the  variations  6(pi  of  (pi  and  80i  of  Oi.  From  (4)  and  (5)  we  find: 

Sli  =  SA/A  -  £;>/2)  cot  ({4>i  -  e3)/2)  Sdj  .  (10) 

As  there  are  (2n  +  1)  variables  in  our  system  of  2 n  equations  we  have  to  impose 
a  supplementary  constraint.  One  choice  has  been  to  fix  one  angle  (for  example 
0O  =  0,  ref.  14).  Here  we  will  make  the  mean  of  the  I{  to  remain  zero  (this  choice 
does  not  single  out  any  angle).  This  fixes  A  via  equation  (4).  Using  (10) ,  and 

£*<%  =  o, 


(SA/A)  1  =  1/(2 n)  O.C.ati  , 


(11) 
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where  a6$  =  {ai%}0<i<n_ v  Here,  1  is  the  unit  matrix  and  O  =  {l}jj  a  matrix 
filled  of  l’s.  Replacing  (11)  into  (10)  gives  6i{  as  functions  of  60 j, 

St  =  -1/2  (1  -  (1/n)  O)  .C.^6$  ,  (12) 

where  (1  —  (1/n)  O)  is  a  projector  on  the  subspace  orthogonal  to  the  completely 
symmetrical  component  (for  a  vector  V  it  is  V0  =  (1/n)  £.  ^).  The  choice, 

=  t{-~ »  •••,  {M  ~  ~}j,  ~}  corresponding  to  adding  a  particle  6£ 

on  needle  j,  ensures  the  conservation  of  the  mean  length  (equal  to  zero).  The  1  is 
for  transposition  (column  vector).  Eq.  (12)  allows  to  express  the  variations  of  60 j 
as  functions  of  the  St.  We  can  also  relate  6<j>i  to  60 j]  Eq.  (5)  gives  : 

0  =  Yjj= o  ai  -  sei )  (1  +  cot2  (( <t>i  -  0])/ 2))  •  (13) 

This  makes  it  possible  now  to  calculate  the  variations  Sfc  : 

Hi  =  (D.5^)i/(D. -ct)i  .  (14) 

Now,  Eq.  (9)  which  provides  the  increase  of  needle  length  per  unit  time  in  the 
diffusion  field,  enables  us  to  introduce  the  growth  probability  pi  for  a  particle  of 
size  61  to  stick  on  the  tip  of  needle  i  during  a  time  St.  Let  (D.W)  =  if, 

Pi  =  <%/(£. dti)  =  (n0"1/2  /  (]T  (no-172).  (15) 

i 

The  set  of  equations,  (12) ,  (14)  and  (15)  together  with  (8)  define  the  successive 
iterations  of  the  growth:  if  at  time  £,  all  the  parameters  are  known,  (15)  determines 
the  growth  probabilities  on  the  tips;  knowing  the  we  calculate  via  (12)  then 
6(p  via  (14),  and  then  the  perturbation  SC  of  C,  and  <$D  of  D.  To  calculate  6§  as 
a  function  of  6i  we  have  to  invert  Eq.  (12).  Unfortunately,  relation  (8)  shows  that 
W  is  an  eigenvector  of  C  for  the  eigenvalue  0,  and  therefore  C  cannot  be  inverted 
directly. 

3.2  Invariants 

In  a  global  rotation  Sti  of  the  angles  0  et  <f>  the  configuration  of  the  needles  and  thus 
all  the  preceding  expressions  remain  invariant.  For  example,  C  is  left  unchanged 
and  from  (12)  and  (8)  bi  also  remains  invariant: 

Jt  =  -(1/n)  (1  -  (1/n)  O)  .C.  (aSt)  +  6i9a>)  =  -(1/n)  (C  -  (1/n)  O.C)  .aSt) . 

(16) 


4  The  two-needles  problem 

We  are  now  ready  to  solve  the  two  needles  case.  Here  n  —  2,  but  the  needles  are 
periodically  repeated  due  to  the  cyclic  boundary  conditions  imposed  as  shown  in 
Fig.l.  With  the  zero  mean  length  constraint,  we  write, 

A>  =  (4>  +  * i)/2  =  0;  h  =  (£0-h)/2. 
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Figure  1.  a)  Set  of  equally  distant  pairs  of  needles  (a0  =  ai  =  1);  b)  General  case. 


The  problem  is  completely  defined  by  the  knowledge  of  the  probability 
to  find  at  time  t  a  length  gap  h  ■  The  general  form  of  C,  due  to  condition  (8)  and 
of  D  with  elements  (dy  =  1  +  cfj)  is  expressed  with  two  unknown  functions  c0(fi) 

and  ci(^i): 


C  = 


J  aiCo  —  <%oco 

\  —ol\C\  a0ci 


f  1  +  (&1C0)2  1  +  (ctoco)2  1 
\  1  +  (ctici)2  1  +  (aoci)2  J 


(17) 


the  growth  probabilities  are  then  found  from  (15)  : 


~f{h) 


Po(?i) 

Pi(^i) 


_ _  f  V^±  QqQici^  |  Qg) 

a/1  +  a0aic02  +  a/1  +  ooaiCi2  l  V1  +  a^co  I 


4.1  Initial  conditions 

Fixing  arbitrarily  0o(O)  =  0,  the  initial  conditions  (t  =  0),  are: 

_  /0  \  .  J  0o(O)  \  _  fo  \  .  j  MO)  If  2arccotv/ai/aoL_  1  (ig) 

^  °  l0)’  l^i (0)/  l7rJ’  1<M°)J  \-2arccotv/ai/aoJ 

and  for  parameter  A, 

A  =  (a0)_“o/2  (o;i)_“l/2  /2.  (20) 


From  this  we  can  calculate  C  (0)  and  D  (0) . 
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4-2  Equations  satisfied  by  c0(£ i)  and  C\(£i) 

From  above,  we  deduce  that  the  functions  c0(£ i)  and  ci(^)  verify  the  differential 
equation: 


dc0(£i)  ,  ( 

1  +  aoCo(^i)2)  | 

^1  +  a2co(^i)2^ 

~  1 

a0ai 

(l  +  a0aiCo(£i)2>Sj  ^ c0(£i )  +  ci(£i)^j 

(21) 


and,  permuting  c0  and  ci,  a  similar  equation  for  ci(£ i).  The  solutions  of  these 
equations  can  be  obtained  parametrically: 

Co(^i)  =  l/(uV«oai);  Ci(Ji)  =  u/v'aoai  (22) 

with  the  relation, 


exp 


/ao  +  oqw2 
\o:i  +  a0u2 


The  growth  probabilities  are  then,  from  (18), 


(a0-ai)/4 


Po(^i)  =  u/(l  +  u);  Pi(£i)  =  1/(1 +u). 


(23) 

(24) 


4-3  Discrete  Fokker- Planck  equation 

The  Fokker-P lanck  equation  of  a  pair  of  needles  is,  with  6£ \  =  6£/2, 

P(Iut  +  St)  =  Po  (Ji  -  Silj  P(£ i  -  «!,  0  +  Pl  (h  +  si)  P(h  +  slut).  (25) 

The  evolution  Eq.  (25)  can  be  expanded  to  second  order  in  6£ , 

dP(l ,  t)/dt  =  D.d27^P(l ,  t)  -  (P(?x ,  t)  u(l ))  ,  (26) 

where  the  following  constants  have  been  introduced, 

v  =  6£/(26t)  and  D  =  6£2 /(8St),  (27) 

v  is  the  relative  growth  velocity  of  needle  0,  D  is  the  ’’diffusion”  coefficient  of  the 
sticking  between  the  two  needles,  and  ix  =  (£0  -  ^i)/2.  At  short  time  the  particles 
stick  at  random  on  both  needle  tips,  up  to  the  moment  when  one  needle  gives  way 
to  the  other.  The  function  U  characterizes  the  screening  effect, 

Ud)  =  (u(h)  -  1)  /(u(l)  +  1),  (28) 

where  u(h)  is  implicitly  defined  by  (23).  The  graph  of  U(h)  is  shown  in  Fig.2a. 

In  the  linear  region,  when  £\  — i »  0  , 

Ud)  ~  ^i/(2a0ai)  (29) 

and  the  diffusion  dominates,  while  when  l  ->  ±oo,  U(d)  -*  ±1,  and  the  longest 
needle  grows  at  velocity  v.  Eq.  (26)  represents  the  diffusion  of  a  ’’particle”  with 
coordinate  £\  in  a  potential, 

vd)  =  -vJ  Ud)dti. 


(30) 
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Figure  2.  a)  Graph  of  U{£\)  for  various  values  of  ckq;  b)  Effective  potential  V^i)  when  ao  —  ai* 


Figure  3.  Distribution  of  probability  P(h,  t)  at  times  t/6t  =  20,30, 120. 


If  the  needles  are  equidistant,  ao  =  ol\  =  1,  and  u{i i)  =  exp^i,  we  have  explicitly, 
u{t\)  =  tanh  ( Ii/2j.  The  potential  (shown  in  Fig.2b)  is  then  explicitly, 


V{1\)  =  -fUog  (cosh(£i/2)^  .  (31) 

The  time  evolution  of  the  solution  of  Eq.  (26)  for  equal  length  needles  (ao  =  ol\  =  1) 
at  t  =  0  ,  is  shown  in  Fig.  3. 


4-4  Fluctuations  of  the  difference  between  the  branch  lengths 

The  average  of  the  difference  between  the  branch  lengths  is  zero  by  symmetry, 

(*i)  (t)  =  J+C°  =  0  (32) 

because  £iP(£i,t)  is  odd.  But  the  mean  square  fluctuation 

(fi  -  (l. i))2}(*)  =  (??)(*)  = 


(33) 
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5  Mullins-Sekerka  instability  of  a  comb  of  needles 

Let  us  consider  now  the  case  of  n  equidistant  needles  for  which  ~ct  =  ^ 

In  this  case  it  is  convenient  to  use  the  Fourier  transform  of  the  above  equations. 

5. 1  Reduction  of  the  vector  space  dimension 

We  introduce  the  Fourier  transform  of  an  arbitrary  n-vector  V*  : 

V  =F.V  with  F  =  {(l/n)exp(27ri  jh/n)}o<j,fe<n-i  (38) 

To  invert  Eq.  (12),  we  have  to  discard  the  completely  symmetrical  component, 
which  makes  C  singular.  The  working  space  will  have  now  (n  —  1)  components,  and 
we  will  use  an  index  r  to  specify  its  reduced  vectors  and  the  rectangular  matrix 
operators  relating  reduced  (n  —  1)- vectors  to  n- vectors.  Thus,  we  introduce  the 
rectangular  Fourier  matrix  : 

Fr  =  {Fkh}i<k<n-l,0<h<n-l  *  (39) 

The  corresponding  distribution  of  modes  is, 

L  r  —  Fr.  L  ,  (40) 

a  particular  mode  k,  being  given  by, 

71  —  1 

4  =  (l/n)]T  £j  exp  (27ri  jk/n ) . 

3=0 


(41) 
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For  an  increase  of  length  8£  on  needle  j, 

6Lr[j]  =  F rM[j]  =  S£/n  {exp  (2m  jk/n )}1<fc<n_1  (42) 

The  reduced  Fourier  matrices  lead  to  the  following  products:  Fr.(F_1)r  —  lr  and 
(F'^r.Fr  =  1  -  (l/n)0.  In  particular,  this  allows  to  write  (12)  as 

it  =  --  (F-x)r.Fr.  C.^.  (43) 

n 

It  is  now  easy  to  invert  Eq.  (43).  Let  Cr  =  Fr.C.(F-1)r  then, 

Id  =  -n  (F_1)r.(CT.)_1Fr.52.  (44) 


5.2  Evolution  of  the  probabilities 

The  probability  P  (!?,£)  to  find  the  distribution  of  needle  lengths  L  at  time  t  is 
now  replaced  by  the  probability  P(Zr,  t)  to  find  a  given  Fourier  mode,  and  we  have 
to  build  the  Fokker-Planck  equation  which  relates  P(Lr ,  t )  and  P(Lr  +  8Lr,  t  +  8t). 

With  probability  pi  (Zr  j  we  add  at  time  t  a  particle  of  size  6£  on  needle  i.  The 
corresponding  change  of  the  lengths  SZ[i]  induces  a  change  in  the  angles  6  and  <y?, 

I§[i\  =  -n  (F_1)r.(Cr)_1Fr.^2[i]  ;  «£[*]  =  {{d.^[i]}  J{D.T}  J-1}  (45) 


from  which  we  determine,  by  differentiation  of  Eq.  (15),  the  value  of  pi(t  +  St): 


Pi(t  +  St)  =  Pi(t) 


OT, 


2n- /2  £  n_1/2 


+  n-1/2(  En71/2)-2E 


snh 


OTT3^2 
h  znh 


(46) 


5.3  Initial  conditions 

In  the  present  case  for  which  the  needles  have  an  equal  length  at  t  =  0,  the  initial 
angles  fa  et  6j  (0  <  i,  j  <  n  -  1)  are  also  regularly  spaced  : 

fa(0)  —  2m jn  et  Qj( 0)  —  7r(2 j  —  1  )/n.  (47) 


5-4  Sticking  probabilities  in  the  linearized  regime 

From  expression  (45)  we  can  calculate  <$(0)  and  8^(0)  corresponding  to  a  variation 
Slj( 0),  for  instance  by  adding  a  particle  on  needle  h.  Thus,  if  a  particle  of  size  6£ 
is  stuck  on  needle  h  of  a  uniform  comb,  then  the  sticking  probability  of  the  next 
particle  is,  using  Eq.  (46),  (p(t  =  0)  =  1  and  =  1) 


Pi  =  i 


SDi(  0) 
2  n3 


+  2^Eu'®(°) 


(48) 


The  sticking  probability  pi  [/i]  of  a  new  particle  on  needle  i ,  while  a  particle  is  already 
stuck  on  needle  h  ( 6ih  is  the  Kronecker  symbol)  is  then, 


P,W  =  ^  +  ~ (1  -  fa))  .  (49) 
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Figure  4.  Screening  of  the  sticking  probability  by  the  longer  needle  h. 


As  expected,  in  this  result  we  see  that  the  other  needles  are  screened  by  the  longer 
needle  h.  This  is  shown  in  Fig  A 


6  Fokker-P lanck  equation  of  the  linearized  comb 

Fiin{  L  ,  t)  be  the  linearized  probability  to  find  a  set  of  needles  of  size 
L  =  the  linearization  supposes  that  the  lengths  4  are 

not  too  different  in  such  a  way  that  no  needle  can  be  completely  screened  (see  Sec. 
4).  In  the  linearized  approximation,  the  sticking  probabilities  on  ~t,  are  then  the 
superposition  of  the  individual  probabilities  p*[4],  where  p*[£h]  is  deduced  from 
Pj[h]  by  replacing  81  by  4  (superposition  rule).  If  between  t  and  t  +  8t,  a  particle 
8t  is  added  on  needle  h  and  if  Lfh  =  -  8t[h],  then 


yun(L,t +st)  =  ^2 yym  Pun(j? [k\,t), 


h—0  j= 0 


With  V[h]  =  *{io  +  f  ,4  +  f , ...,4  -  W  +  f , ..,4-1  +  f }. 

We  recover  the  linearized  form  of  the  particular  case  n  —  2  of  (25): 

iW4,t+ft)  =  (p0[ft]  +  -  Wl,t)  +  +P1[^i])fl<n(2i  +6lu  t) 

(51) 

as  to  first  order, 

PO  (?1  -  «i)  *  (1  +  (4  -  4  -  Si) /A)  /2  =  P0[^']  +  p°[4] 

Pi  pi  +  «i)  -  (1-  (4  -  4  -  M)/4)  /2  =  p\Q  +  p1^].  (52) 

In  the  general  linearized  case  we  obtain  from  Eqs.  (49,50), 

Plin(  L  A  +  8t)  =  £  ^L=0  4n(?W^)  + 

2^2  S/i=0  (^/i  _  cs°2  ((ft  “  jW71)  Plin(X 7 [ft],  *) 

with  Z^ft]  =  Z{.-  <5Lr[h]  defined  by  Eq.  (42) . 


(53) 
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6. 1  Fourier  transform  of  the  kinetic  equation,  and  the  Fokker-Planck  equation 
Equation  (53)  can  be  written  after  Fourier  transformation, 


Plin(  L  r,t  +  St)  — 


In  the  above  equation,  we  have  used  the  relation, 

g-^jkcsc2  [j'K/n)  =  (n2  —  6kn  +  6k 2  —  l)/3.  (55) 

j¥=  0 

Second  order  Taylor  expansion  of  the  above  Eq.  (54)  with  respect  to  SLr[h\  makes 
it  clearer.  Neglecting  the  third  order  terms  in  61,  we  find  the  remarkable  Fokker- 
Planck  equation  (note  that  £-k  =  4-fc,  k  being  defined  modulo  n), 


n— 1 

E 

h=0 


n-1 
6  n2 


n  —  1 

6£  +  E  e- 

fc=i 


^fhkk 

n 


(l-^)Ik)Pun(Lr-SLT[h},t). 


9P [in  (  L  r,  t) 

m 


61^  d 


vc  ^  u 

—  St  ^ 


6£2 


l--)ekPlin{Lr,t) 

n  J 


y.1  d2Plin(Lr,t)  se_ 

+  ^  or..  rPn  .  2 nSt 


2n26t  ^  dtudl-w 


d2Plin(  L  r ;  j) 


fc,fc'=l 


d£k'd£k-k' 


When  n  —  2,  we  recover  (26)  linearized  with  a0  =  ol\  =  1,  and  U(£i)  ~  4/2  from 
Eq.  (29). 


£.£>  Short  discussion  of  the  linearized  kinetic  equation 
Equation  (56)  contains  three  terms  : 

i)  the  first  term  is  a  drift  term,  associated  with  the  growth  velocity  v.  At  the  end 
of  the  linear  regime,  we  expect  an  exponential  screening  behavior  similar  Jo  U{i\) 
in  the  2-needles  case.  The  remarkable  point  lies  in  the  factor  £  (l  -  £)  4-  If  we 

start  from  a  situation  where  all  the  modes  k  have  the  same  weight  4  =  £,  then  due 
to  the  £  (1  -  £)  coefficient,  the  mode  /cmax  =  n/2  dominates  progressively  since  it 
has  the  highest  growth  rate.  Due  to  screening,  most  of  the  other  modes  disappear, 
and  the  system  becomes  equivalent  to  a  comb  of  n/2  needles:  a  succession  of  period 
doubling  is  the  result  of  the  growth. 

ii)  The  second  term  couples  the  mode  k  and  n  —  k. 

iii)  The  third  term  is  a  mode  coupling  term,  which  merges  two  modes  kf  and  k " 
into  a  mode  k  =  k'  +  k". 


6.3  Correlation  between  modes  and  fluctuations  of  a  mode  q,  in  the  initial 
regime 

We  first  calculate  the  correlation  between  modes, 

<4A2)(*H/  £qi£q7P{L  r)t)dL  r  . 


(57) 
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Integration  by  parts  using  Eq.  (56)  gives, 


dt  \£qi£q2 


6£ 

(iL 

(i  _  £l 

)  +  Slt 

St 

v  n 

\  n . 

)  n  * 

_L  _ 

5l2 

bt2 

2  n2St 

°gi+92,o 

1 

2  nbt 

The  fluctuation  of  a  mode  <7,  can  be  also  determined  from  Eq.  (56)  and  leads  to, 

» n, ;)  <*■>♦  w 

S£ 2  Si 2 

+rf8t8q'n/2  +  ^6t^~  6q'n/^  ’ 

which  has  a  solution  similar  to  the  two-needles  case,  which  underlined  the  existence 
of  a  q- dependent  effective  velocity, 

-*"<«> -fJM)  <“> 


/7l\  2DlS9,n/2  +  2D2  (1  -  Sq  n/2)  ,  ,  ,  s  ,s  „ 

W«n  =  - -  (6XP  M*)*)  ~  !]  (61) 

and  shows  that  the  mode  q  =  n/2  has  the  fastest  growth.  We  have  noted, 

„  se2  ,  „  se2 

Dl  =  2^St  and  °2  =  2n6t'  (62) 

The  correlation  between  the  modes  has  a  similar  behaviour, 

1 7  7  \  _  *^1^91+92,0  H“  (1  “  ^i+g2,o)  r  /  /  NA 

Vqi  Ain  ~  - ^(91,  <?2) -  [eXP  {Veff  (63) 

with  an  effective  velocity, 

'»<«•«■>-  B  (?  (-5). i+?  (*-?))  <«> 

Further  studies  are  in  progress  to  relate  the  linear  regime  to  the  screened  regime 
and  to  obtain  analytically  the  scaling  laws  suggested  numerically  by  Krug  et  al ,4 
and  Adda  Bedia14. 
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ENTROPY  DYNAMICS  ASSOCIATED  WITH  SELF-ORGANIZATION 
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A  general  model  linking  dynamics  of  informational  entropy  and  the  self-organization  process  in  an 
open,  steady-state,  nonequilibrium  system  is  proposed.  Formulas  for  dynamics  of  the  informational 
entropy  flow  and  its  rate  are  developed  with  respect  to  random  process  of  influences  exerted  upon  a 
system.  It  was  revealed  that  the  open  system  responds  to  a  strong  change  of  conditions  by  steep 
growth  of  the  informational  entropy  flow  up  to  a  maximum  value  at  the  critical  point  associated  with 
the  self-organization  process.  An  example  of  self-organization  during  elasto-plastic  deformation  of 
metal  is  considered. 


1  Introduction 

In  this  paper,  we  consider  an  open,  steady-state,  nonequilibrium,  active  system  to  be  a 
dissipative  system.  The  spectrum  of  such  systems  is  quite  broad.  If  an  exchange  of  energy 
and  matter  with  environment  is  structured,  we  can  designate  it  as  a  transfer  of  information 
(entropy).  In  this  broad  sense,  information  exchange  is  not  necessarily  limited  to  “intel¬ 
ligent”  systems  [1].  Presence  of  an  energy  flux  from  an  external  source  to  a  system  and 
the  dissipation  of  energy  on  external  environment  are  the  preconditions  of  activity  in  any 
system.  Because  of  this,  the  evolution  of  open  active  systems  does  not  necessarily  lead 
towards  the  equilibrium  state.  On  the  contrary,  open  systems  may  be  involved  in  proces¬ 
ses  of  self-organization,  which  result  in  more  complicated  and  more  advanced  structures. 

As  is  known  [8,9]  for  open  systems  the  variation  of  the  entropy  dS  for  an  interval  of 
time  dt  can  be  decomposed  into  a  sum  of  two  components  d«S  and  diS,  with  quite 
different  physical  meanings.  d*S  is  the  entropy  flow,  which  depends  on  the  processes  of 
matter  and  energy  exchange  between  system  and  environment.  diS  is  the  entropy 
production,  caused  by  irreversible  processes  inside  the  system.  If  conditions  deS<0  and 
|dcS|>diS  are  observed,  the  certain  stages  of  temporal  evolution  in  the  open  system  can 
occur  at  a  general  downturn  of  entropy  dS<0. 

It  is  necessary  to  note  that  such  a  situation  is  possible  only  far  from  equilibrium,  as  in 
an  equilibrium  state  the  member  d$  always  prevails.  It  means  that  a  system  is  so  far  from 
its  equilibrium  that  the  linear  laws  no  longer  apply;  nonlinear  terms  become  important. 
Self-organization  is  the  “supercritical”  phenomenon.  Nevertheless  far  from  equilibrium, 
the  system  may  still  evolve  to  some  steady  state.  In  far  from  equilibrium  conditions, 
various  types  of  self-organization  processes  may  occur. 

According  to  the  traditional  interpretation  of  entropy,  as  a  measure  of  disordering 
(uncertainty)  of  a  system,  it  should  be  considered  that,  if  the  disorder  decreases  at  the 
expense  of  entropy  return  in  the  course  of  evolution,  the  system  evolves  into  more 
complicated  and  more  advanced  structures  [8]. 

We  present  a  general  model  linking  dynamics  of  informational  entropy  and  the  self¬ 
organization  process  in  an  open  system.  We  analyze  the  local  zone's  behavior  in  different 
“modes  of  being”  [3].  Formulas  for  dynamics  of  the  entropy  flow  and  its  rate  are  obtained 
with  respect  to  process  of  influences  exerted  upon  a  system.  Finally  we  describe  a  few 
examples  in  order  to  elucidate  the  received  results. 
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2  Definitions  and  Mathematical  Formulation 

Let  an  open  system  be  in  a  steady  (stable),  nonequilibrium  state.  Further,  let  us  assume 
that  in  this  state  the  entropy  production  is  compensated  for  by  a  negative  entropy  flow 

4S  =  -deS,  (1) 

i.e.,  the  system  gives  back  so  much  entropy  as  is  produced  inside  the  system.  Eq.  (1) 
establishes  a  condition  of  current  balance  between  the  entropy  flow  through  the  system 
and  its  production  inside  the  system.  Current  balance  is  understood  as  a  stationary  (not 
time-dependent)  nonequilibrium  state  of  an  open  system,  stable  in  relation  to  small 
deviations.  The  system  in  this  state  is  actually  in  dynamic  balance  with  its  environment. 
The  entropy  production  inside  the  system  in  a  certain  sense  characterizes  exhaustion  of 
the  system’s  lifespan.  Formulas,  describing  dynamics  of  the  entropy  flow  and  its  rate,  will 
be  deduced  below.  According  to  Eq.  (1),  these  terms  can  be  extended  to  the  internal 
entropy  production  inside  the  system  in  a  steady  (stable)  nonequilibrium  state. 

All  real  processes  are  irreversible  and  unbalanced  in  some  degree.  Local  gradients  of 
temperature,  chemical  potential,  pressure  can  exist  only  in  a  nonequilibrium  system.  Let 
X  be  an  important  local  parameter,  determining  longevity,  lifetime,  load-carrying  capacity 
or  any  other  property  of  vital  significance  for  an  open  system  in  a  local  zone  (hereafter  we 
call  X  the  determining  parameter).  Further,  let  us  label  a  zone  of  an  increased  gradient  of 
the  determining  parameter  as  the  local  dissipative  zone  ( LDZ ).  Such  a  local  zone  limits 
the  lifetime  of  a  system.  As  a  rule,  these  zones  correspond  to  places  of  the  most  probable 
failures  during  its  lifetime.  In  the  course  of  intensive  operating,  the  number  of  LDZ  can 
grow.  The  growth  dynamics  of  the  LDZ  number  is  an  important  property  of  the  behavior 
of  dissipative  system;  however,  in  this  paper  this  question  is  not  considered.  Here  we 
investigate  the  evolution  of  a  system  based  on  processes,  occurring  only  in  one  LDZ. 

There  are  at  least  two  auto-regulating  mechanisms  of  energy  dissipation,  which  act 
inside  a  system.  We  consider  the  behavior  of  a  LDZ  by  using  the  model  of  a  bistable 
element.  A  bistable  element  (Fig.  1)  has  two  stable  states  (down  and  up),  in  each  of  which 
it  can  exist  for  a  rather  long  time.  Let  us  denote  the  mean  value  of  the  determining 
parameter  of  a  system  in  the  down  state  as  Xo  and  in  up  state  as  Xi .  External  influences  or 
internal  changes  in  a  system  can  result  in  transition  of  the  bistable  element  from  one  state 
to  another.  It  was  assumed  that  the  transition  from  down  state  to  up  state  is  caused  by 
such  an  external  influence  exerted  upon  a  system,  at  which  determining  parameter  X  (the 
average  over  a  volume  of  LDZ)  exceeds  mean  value  of  the  threshold  level  X*. 

In  each  particular  case,  the  physical  content  of  determining  parameter,  its  threshold 
level  and  criterion  of  transition  from  one  state  to  another  are  defined  by  the  type  and 
natures  of  the  system  examined  and  depend  on  the  statement  of  a  problem.  In  Sec.  5  we 
shall  consider  interpretation  of  this  concept  for  elasto-plastic  deformation  of  metal  in  the 
local  zone  of  a  load-carrying  structure. 

For  studying  the  behavior  of  the  LDZ  of  an  open  system,  the  mathematical  apparatus 
of  Markovian  stochastic  processes  was  used.  Solution  of  the  differential  Kolmogorov 
equations  for  the  probabilities  P0(t)  and  Pj  (t)  of  respectively,  the  down  and  up  states  of  the 
bistable  element  was  obtained  [11].  The  case  of  a  homogeneous  Markovian  process  were 
considered,  that  is  the  transition  rates  p.(t)  and  v(t)  respectively,  from  the  up  state  to  the 
down  state  and  vice  versa  do  not  depend  on  time:  |x(t)=|x;  v(t)=v.  Hereafter  we  call 
a  =  v/fj,  the  regime  parameter.  Depending  on  the  regime  parameter,  a,  three  typical 
“modes  of  being”  [3]  the  LDZ  are  considered:  light  at  a<l,  v<n,  P0>Pi,  i.e.,  the  LDZ  is  in 
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Figure  1.  Graph  of  transition  in  the  bistable  element  modeling  the  LDZ  behavior. 


the  down  state  for  a  long  period  of  time;  symmetric  at  ot=l,  v~\x,  Po=  Pi  -  0.5;  heavy  at 
a>l,  v>p,  Pi>P0,  i.e.,  the  LDZ  is  in  the  up  state  for  a  long  period  of  time. 


3  General  Results 


Let  us  consider  the  behavior  of  the  statistical  properties  of  the  determining  parameter  X 
for  the  examined  LDZ  in  the  course  of  evolution.  Time  dependence  of  mean  value  X(t)  of 
the  determining  parameter  was  determined  under  the  obtained  formula 

X{t)=\x()  +  aXx  -a(Xx  -X0)e^'j/(l+a),  (2) 


where  p  =  v  +  jx.  At  transient  stage,  the  mean  value  has  the  tendency  to  grow  or  decrease 
depending  on  the  initial  conditions,  as  shown  in  Figure  2  by  solid  line.  Curve  1  occurs 
when  the  initial  state  is  the  down  one  and  curve  2  -  when  the  initial  state  is  the  up.  The 
dashed  line  corresponds  to  the  steady  level  of  the  mean  value  of  determining  parameter. 
One  can  observe  that  both  lines  1  and  2  approached  the  steady  level  at  stationary  stage. 

The  second  of  the  principal  statistical  properties  we  studied  is  variance  of  the  deter- 
mining  parameter  X.  The  time  course  of  a  variance  Dx(t)  was  described  by  term 

dx  (o = (x,  - x*  y  [po  (o  -  po  (*)]  •  (3) 

The  graph  of  time  courses  of  the  variance  Dx(t)  for  heavy  and  light  modes  of  the  system 
being  is  shown  in  Figure  3.  Since  in  the  initial  moment  of  time  P0(to)-l,  we  have 
Dx(to)=0.  At  the  stationary  stage  (at  t-^oo)  we  have 


Dst 


a 

(IT7F' 


For  heavy  mode  of  being  at  a>l  in  a  critical  point,  corresponding  to  a  moment 


h 


V  2  a  , 


(4) 


(5) 


the  function  (3)  has  a  maximum.  The  light  mode  of  the  LDZ  being  when  a<l  (See  line  2 
in  Figure  3)  is  characterized  by  absence  of  the  critical  point  and  by  stabilization  of  the 
variance  on  the  level  Dst  in  the  course  of  evolution  towards  a  stationary  stable  state. 
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Figure  2.  The  time  courses  of  the  mean  value  of  determining  parameter  (solid  lines)  from  down  (1)  and  up  (2) 
initial  states.  Steady  level  is  denoted  by  a  dashed  line. 


tb  Time  t 


Figure  3.  The  time  courses  of  the  variance  Dx(t)  for  heavy  (1)  and  light  (2)  modes  of  being  (solid  lines).  Steady 
level  Dst  is  denoted  by  a  dashed  line.  The  coordinates  of  the  critical  point  are  labeled  as  Dmax  and  tb. 


Let  us  assume  that  the  random  process  Y(t)  of  dynamic  influences  with  probability 
distribution  function  (PDF)  F(Y),  is  exerted  upon  a  system.  This  process  produces  a 
similar  random  process  X(t)  of  changing  the  determining  parameter  X  in  the  LDZ. 
Further,  let  us  assume  the  PDF  for  instantaneous  values  of  the  random  process  is  F(X).  A 
dependence  of  regime  parameter  a  on  the  PDF  of  this  process  is  obtained  in  the  form 


233 


F(Xth)  =  Po  =  l/(l+a),  1  -  F(Xth)  =  Pi  =  a/(l+a),  (6) 

where  F(Xth)  is  the  value  of  PDF  F(X)  at  the  threshold  level  of  X*,  P0  and  P,  are  final 
probabilities  of  the  down  and  up  states  corresponding  to  the  stationary  stage.  In  Sec.  5  we 
shall  obtain  functions  connecting  regime  parameter  a  and  statistical  properties  (mean 
value  and  variance)  of  normal  (Gaussian)  random  process  exerted  upon  a  system. 

The  main  point  of  interest  of  the  foregoing  analysis  is  the  possibility  of  computing 
and  forecasting  the  time  course  of  entropy.  Note  that  various  interpretations  of  entropy 
are  internally  linked  quite  closely  [7,8]-  The  physical  entropy  of  a  system  coincides  with 
the  thermodynamic  entropy  S.  The  informational  entropy  H  is  connected  to  them  by  a 
ratio  [7]: 

S  =  kH  In  2 ,  (7) 


where  k  is  the  Bolzmann  constant.  We  consider  the  informational  entropy,  being  the 
measure  of  uncertainty,  and  equal  to  the  amount  of  information  (according  to  Shannon) 
required  for  removing  this  uncertainty.  For  the  bistable  element  modelling  the  LDZ,  the 
informational  entropy  H  is  determined  from  the  equation  [7]: 

H=-fdPJ(t)\og2PJ(t).  (8) 

J=o 

The  minimum  value  of  entropy  H  =  0  corresponds  to  the  degeneration  of  a  stochastic 
system  into  a  rigid  determinate  system.  For  open  self-organizing  systems  the  maximum 
Hmax  =  1  corresponds  to  a  moment  of  bifurcation,  when  there  is  the  destruction  of  pattern 
(microstructure)  exhausting  its  dissipative  abilities,  and  resulting  in  an  emergence  of  new 
pattern  at  other  hierarchical  levels.  Taking  into  account  decomposition  of  entropy  into  a 
sum  of  two  components  and  Eq.  (1)  for  current  balance,  we  have  considered  dynamics  of 
the  informational  entropy  flow  in  the  LDZ  of  a  system  and  obtained  the  analytical 
dependence  on  of  the  informational  entropy  flow  on  time  in  the  following  form: 
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J  .  (9) 


The  time  courses  of  the  entropy  flow  H(t)  is  shown  in  Figure  4.  Analysis  of  Eq.  (9)  shows 
that  in  heavy  mode  of  the  LDZ  being  (at  oOl)  at  the  time  tb,  determining  by  Eq.  (5),  the 
maximum  entropy  flow  is  reached,  H(tb)~l.  It  can  be  shown  that  this  moment  corresponds 
to  the  condition  of  equal  probabilities  (maximum  uncertainty)  of  keeping  the  LDZ  in  the 
down  and  up  states  P0(tb)  =  Pi(U  =  0.5.  Note  that  this  moment  tb  corresponds  to  the  crit¬ 
ical  point  of  maximum  variation  Dxmax  of  parameter  X.  That  means,  some  fluctuations  get 
amplified  up  to  a  macroscopic  scale.  In  this  way,  fluctuations  -  environmental  pertur¬ 
bations  or  eigenfluctuations  -  may  drive  the  system  into  a  completely  new  state  and  thus 
become  the  driving  force  of  system  development  (“order  through  fluctuation  [9]). 
Instabilities  at  the  same  time  can  break  symmetry,  that  is,  bifurcations  occur:  the  system 
may  choose  among  two  states,  though  determined  by  causality.  The  selection  of  the  future 
path  of  development  is  unpredictable  now  [5].  However,  at  least  we  can  predict  the 
moment  of  maximum  uncertainty  and  endeavor  to  take  precautions  against  unfavorable 
paths  of  development. 

After  passing  the  system  through  the  critical  point,  which  is  a  stochastic  analogue  of 
the  bifurcation  point  [8,12],  the  entropy  flow  decreases  and,  leaving  the  transient  stage, 
stabilizes  on  the  steady  level  Hst: 
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Hst  =ln  (\  +  a)/al+a  /ln2-  (10) 

According  to  the  traditional  interpretation  of  entropy,  it  means  that  because  of  entropy 
outflow  in  the  course  of  evolution,  the  disorder  decreases.  A  system  is  structured  in 
response  to  the  heavy  being  mode  by  self-organization  of  more  complicated  and  more 
advanced  patterns.  This  is  an  attribute  of  the  system’s  adaptation  to  the  random  process 
characterized  by  the  regime  parameter  a>l . 

The  light  being  mode  of  the  LDZ  when  a<l  (See  line  2  in  Figure  4)  is  characterized 
by  absence  of  the  stochastic  analog  of  the  bifurcation  point  and  by  stabilization  of  the 
entropy  flow  on  the  level  Hst  during  the  period  of  exit  from  the  transient  to  stationary 
stage. 

Special  interest  is  attracted  by  dynamics  of  the  entropy  flow  rate  in  the  course  of  the 
system’s  evolution.  The  rate  is  defined  as  the  derivation  of  the  entropy  flow  function  as: 

dH  _  ve'*  ,[<*(1-^)1  (11) 

dt  In  2  l\  +  ae-pt  _ 

Plot  of  the  entropy  flow  rate  with  time  is  shown  in  Figure  5.  Analyzing  the  function  (11) 
allows  us  to  conclude  that  the  response  of  a  system  to  the  heavy  being  mode  (a>l)  by  a 
rapid  increase  of  the  entropy  flow  takes  place  simultaneously  with  the  reduction  of  the 
entropy  flow  rate  to  zero  at  the  moment  of  time  tb.  Hereafter  the  rate  of  entropy  flow 
becomes  negative,  passes  through  a  minimum,  and  aspires  to  zero,  when  the  transient 
process  approaches  the  steady  (stable)  stage. 


4  Influence  of  the  Variation  of  Conditions 

Consider  now  the  behavior  of  the  LDZ  with  the  variation  of  external  and/or  internal 
conditions  of  a  system.  The  main  point  of  interest  of  the  foregoing  analysis  is  the 
possibility  of  computing  and  forecasting  a  system  response  (namely  the  time  course  of  the 
informational  entropy)  to  a  change  of  die  being  conditions.  Denote  values  of  the 
quantities,  corresponding  to  a  new  set  of  conditions  letters  with  an  asterisk  The  change  of 
conditions  may  lead  to  both  the  modification  of  the  regime  parameter  a*  and  the 
probabilities  P0*  and  Pi*.  Timing  of  time  t*  started  again  at  the  moment  of  variation  of 
the  conditions.  Solution  of  the  differential  Kolmogorov  equations  for  the  probabilities 
P0*(t*)  and  Pi*(t*)  was  obtained  [11]. 

An  important  role  is  played  by  analysis  of  the  functions  of  entropy  flow  H*(t*)  and 
its  rate  dH*/dt*  as  a  response  of  an  open  system  to  a  sudden  change  of  the  external  and/or 
internal  conditions.  Using  Eq.  (8),  a  mathematical  expression  for  the  time  course  of  the 
informational  entropy  flow  H*(t*)  under  new  conditions  was  obtained  in  following  form: 
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A  function  of  the  informational  entropy  flow  rate  after  a  variation  of  conditions  of  the 
system  existence  can  be  expressed  as: 


dt*  /3ln2  6  \u*fi+{v*fi-v/j*)e-p'‘‘ 


The  initial  rate  of  the  entropy  flow  is  equal  to 
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It  is  evident  that  the  obtained  Eq.  (12)  to  Eq.  (14)  depend  on  the  combination  of  the 
previous  and  new  being  conditions. 

Figure  6  shows  the  graphs  of  the  informational  entropy  flow  H*  and  its  rate  dH*/dt* 
for  the  case  when  being  conditions  become  heavier  (<x<l,  a*>l).  On  the  first  time 
interval  (0<t<1.4  sec)  when  oc<l,  the  LDZ  exists  in  an  easy  being  mode,  so  the  entropy 
flow  is  stabilized  at  a  level  Hst,  corresponding  to  this  mode.  At  the  time  t  =  1.4  sec,  being 
conditions  become  heavier  (a*>l).  An  open  system  responds  to  a  strong  change  of  con¬ 
ditions  by  a  rapid  increase  of  the  entropy  flow  from  the  stationary  level  H*,  obtained 
under  previous  being  conditions,  to  the  maximum  value  H*(tb*)  =  1.  At  the  time  tb*  the 
rate  of  entropy  flow  sharply  falls  up  to  zero.  The  mathematical  expression  for  the  moment 
of  time  tb*  was  obtained  in  the  following  form: 


1  ln(«*-iX«+i) 

P*  2  (a* -a) 


(15) 


One  can  note  that  it  depends  on  a  combination  of  the  previous  and  new  being  conditions. 
In  a  time  tb*  after  transition  in  a  heavier  being  mode  (a*>l),  the  LDZ  passes  across  a 
bifurcation  point.  This  point  is  associated  with  destruction  of  the  pattern  of  the  first 
hierarchical  level,  exhausting  its  dissipative  possibilities,  and  emergence  of  a  new  ap¬ 
propriate  pattern,  corresponding  to  the  changed  being  conditions.  Leaving  on  the  second 
level  of  hierarchy  after  the  jump  of  development,  the  LDZ  enters  an  evolutionary  stage  of 
development.  There  is  the  rather  slow  stabilization  of  the  entropy  flow  during  this  stage  at 
the  expense  of  saturation  by  the  information  up  to  a  level  1=1-  H**,  which  corresponds 
to  a  new  mode  of  system  existence.  In  other  words,  the  LDZ  adapts  to  new  being  con¬ 
ditions  by  perfection  of  structure.  In  this  case,  the  entropy  flow  rate  (See  Fig.  6)  gets  a 
negative  value,  passes  through  a  minimum  and,  remaining  negative,  aspires  to  zero,  when 
the  transient  process  reaches  the  stationary  stage  with  a  new  steady  level  of  the  entroov 
flow  lit*: 


Hst*  =  In 
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*)/(a  *)l+a* 


/ln2. 


(16) 


adequate  to  new  conditions. 

In  passage  through  the  critical  point  at  time  tb*  the  variance  D*x  of  the  systems  deter¬ 
mining  parameter  X  is  maximum.  The  system  is  characterized  at  this  stage  by  the  highest 
degree  of  disordering,  with  the  random  fluctuations  manifested  on  the  macroscopic  level. 
After  passage  through  the  critical  point,  the  variance  D  x  is  stabilized  on  a  new  stationary 
level  corresponding  to  the  changed  being  conditions. 
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Figure  6.  Response  of  a  system  to  a  change  of  the  being  conditions.  The  time  courses  of  the  informational 
entropy  flow  H(t)  (a)  and  its  rate  dH/dt  (b)  for  the  case  when  mode  of  being  becomes  heavier  (a<l,  a*>l).  The 
coordinates  of  the  critical  point  are  labeled  as  H*max  andt*t>. 


There  is  no  place  for  more  detailed  examination  of  behavior  in  the  course  of  time  of  the 
statistic  properties  of  the  systems  determining  parameter  X  after  changing  the  being 
conditions.  Let  us  note  only,  that  there  are  possible  both  the  cases  of  monotonous  increase 
or  decrease  of  the  variance  D*x  and  the  cases  of  emerging  the  characteristic  peak  on  a 
curve  of  variance  D*x.  The  characteristic  peak  is  similar  to  that  shown  in  Figure  3  and 
corresponds  to  a  critical  point.  Passing  through  such  a  typical  peak  and  stabilizing  the 
variance  D*x  on  a  new  steady  level  D*ST  confirm  completion  of  the  period  of  running  in 
(“bum-in”)  of  a  system  and  its  adaptation  to  new  changed  conditions.  When  testing  the 
system,  it  is  possible  to  measure  and  record  the  time  history  of  the  mean  value  and 
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variance  of  determining  parameter  in  the  vicinity  of  some  LDZs.  We  believe  that  on  a 
basis  of  analysis  of  processes,  occurring  in  the  LDZ,  it  should  be  possible  to  speed  up  the 
reception  of  data  concerning  the  termination  of  the  transition  to  new  state  as  well  as 
adaptation  of  system  to  changed  conditions.  In  Sec.  6  we  shall  consider  the  plots,  rep¬ 
resenting  the  time  evolution  of  the  mean  value  and  the  variance  for  process  of  nucleation 
in  model  system  involving  multiple  steady  states  [8,12]. 

An  open  system  of  any  nature  comes  back  in  a  steady  stable  state  due  to  the  inflow  of 
information  from  the  outside  or/and  redistribution  of  the  informational  entropy  between 
hierarchical  levels  of  the  system  [2]. 

After  passage  through  the  bifurcation  point,  the  entropy  flow  decreases  in  accordance 
with  an  information  accumulation;  that  means  appropriate  increase  of  an  organization 
level  during  a  system  development.  At  each  hierarchical  level  of  a  system’s  evolution  at 
the  end  of  self-organization  process,  when  the  “architecture”  of  the  system  was  basically 
defined  and  becomes  saturated  by  the  information,  the  entropy  curve  is  gradually  straigh¬ 
tened,  displaying  the  transition  of  a  system  to  the  evolutionary  stage  of  development.  A 
growth  in  the  degree  of  organization  for  any  system  has  a  limit,  area  of  saturation,  deter¬ 
mined  by  the  limited  opportunities  of  an  information  accumulation  in  the  given  structure 
at  a  given  hierarchical  level. 

Such  a  picture  of  temporal  evolution  of  an  open  system  with  respect  to  variation  of 
the  being  conditions  agrees  with  the  synergetic  approach  to  processes  of  self-organization 
in  nonequilibrium  dissipative  systems  [1,4, 8, 9].  Obtained  analytical  dependencies  reveal  a 
quantitative  ratio,  reflecting  the  dynamics  of  the  entropy  flow  and  its  rate,  which  shows 
the  evolution  of  open  systems  in  the  course  of  its  life  cycle  under  variation  of  being 
conditions.  The  derived  laws,  in  our  opinion,  are  applicable  for  open  nonequilibrium 
systems  of  various  natures:  technical,  economic,  biological,  ecological,  social,  etc.  The 
discussed  model  and  obtained  dependencies  make  it  possible  on  a  unified  basis  to 
describe  the  whole  life  cycle  of  a  system,  including  passage  across  a  sequence  of 
bifurcation  points  (“jumps”  of  development)  and  evolutionary  stages  of  development  at 
each  hierarchical  level.  Transition  to  a  new  level  of  development  goes  from  disorder  to 
order,  through  the  phenomenon  of  instability  in  the  bifurcation  points,  where  a  system  has 
a  number  of  options  to  diverge  in  several  directions. 

In  the  following,  we  shall  give  a  few  examples  in  order  to  elucidate  the  received 
results. 


5  Example  1.  Self-Organization  during  Elasto-Plastic  Deformation  of  Metal 

Here  we  consider  an  example  of  self-organization  process  during  elasto-plastic  defor¬ 
mation  of  metal  in  local  zones  of  load-carrying  structures.  From  the  thermodynamic  and 
synergetic  points  of  view,  a  material  undergoing  plastic  deformation  is  an  open  system 
brought  far  from  equilibrium  conditions  [4,6,10].  In  this  case,  the  LDZ  is  represented  by 
the  zone  of  stress  (strain)  concentration.  Let  us  accept  as  determining  parameter  X, 
limiting  the  lifetime  of  the  load-carrying  structure,  stress  intensity  a15  determined  from 
the  distortion  energy  (Huber- Von  Mises-Hencky)  theory.  It  was  assumed  that  transition 
from  down  state,  corresponding  to  the  elastic  behavior  of  metal  in  the  LDZ,  in  up  (the 
plastic  yielding  state)  is  caused  by  such  an  external  influence,  exerted  upon  the  system, 
under  which  the  determining  parameter  exceeds  a  threshold  level  equal  to  the  mean  value 
of  the  yield  strength  as,  that  is:  >  as. 

The  main  point  of  this  example  is  to  show  how  the  random  loading  process  exerted 
upon  the  load-carrying  structure  is  linked  with  the  structural  response  (namely  the  time 
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course  of  entropy)  to  a  change  of  the  loading  conditions.  Let  us  accept  the  hypothesis  that 
instantaneous  values  of  the  loading  process  have  a  normal  distribution  with  the  following 
statistical  properties:  mean  value  am  and  variance  DCT  of  the  stresses.  Using  Eq.  (6),  the 
relations  that  link  the  regime  parameter  a  with  these  statistical  properties  and  also  with 
the  threshold  level  (mean  value  of  yield  stress  as)  were  obtained  in  the  following  form: 


a  = 
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The  dependence  of  a  on  am  and  Dc  for  low-carbon  steel  was  computed  from  Eq.  (17), 
(18)  and  plotted  in  Figure  7.  We  used  this  approach  for  a  load-carrying  welded  structure, 
namely  the  pivoting  section  of  a  gondola  car  body.  Examination  of  the  time  history  of 
stress  in  elements  of  the  gondola  car,  including  directly  the  welded  joints,  carried  out 
using  the  automated  system  for  amplitude-spectral  analysis,  shows  that  the  PDF  of  the 
instantaneous  values  of  the  random  process  is  normal. 

In  the  course  of  elasto-plastic  deformation  in  metals,  a  number  of  dislocation  patterns 
(microstructures)  are  formed.  One  after  the  other  ball,  cellular,  persistent  slip  band,  quasi- 
amorphous  microstructures  arise  and  are  destroyed,  consistently  replacing  each  other  on  a 
background  of  existing  grains’  boundaries.  Further  increase  of  load  results  in  formation  of 
the  crack  origins  in  a  quasi-amorphous  zone  and  growth  of  their  density.  The  spontaneous 
emergence  of  a  quasi-amorphous  microstructure  corresponds  to  achievement  of  a 
maximum  disorder  in  this  local  zone,  at  which  point  the  thermodynamic  entropy  is 
maximum  and  equal  to  the  enthalpy  of  melting.  All  these  transitions  are  supervised  by 
achievement  of  a  maximum  level  of  the  entropy  flow  [4].  Under  action  of  an  energy  flux, 
pumped  up  by  the  stochastic  loading  process  in  the  LDZ,  the  deformation  ability  of  metal 
at  the  lowest  hierarchical  structural  level  is  exhausted  Getting  through  a  critical  point  of 
bifurcation,  metal  passes  to  a  higher  level  of  the  pattern  hierarchy  to  microstructure, 
having  the  higher  dissipative  characteristic  [4].  This  transition  between  patterns  on  the 
microscopic  level  is  reflected  by  the  curves  of  the  entropy  flow  and  its  rate,  obtained 
before.  Returning  to  Figure  6,  note  that  at  the  first  time  stage  (0<t<1.4  sec)  the  regime 
parameter  a  was  determined  by  using  Eq.  (17)  to  be  equal  to  0.25.  It  corresponds  to  the 
vertical  (gross)  dynamic  load  of  the  gondola  car.  The  value  a*=5  corresponds  to  the  total 
effect  of  the  vertical  and  longitudinal  inertial  (in  collision  of  cars)  loads.  That  mode  of  the 
LDZ  being  matches  the  transition  of  metal  in  plastic  state.  In  Figure  6,  one  can  see  that  a 
load-carrying  structure  responds  to  the  heavy  conditions  by  a  rapid  increase  of  the  entropy 
flow  from  the  stationary  level  FL.t,  obtained  under  vertical  load,  to  the  maximum  H*max  at 
the  time  tb*  This  critical  point  is  associated  with  the  destruction  of  the  pattern  exhausting 
its  dissipative  abilities,  and  the  emergence  of  new  microstructure  at  another  hierarchical 
level  corresponding  to  plastic  state  of  the  LDZ.  The  local  zone  becomes  structured, 
responding  to  the  heavy  being  mode  by  self-organization  of  more  advanced  patterns. 
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Figure  7.  Change  of  the  regime  parameter  a,  as  a  function  of  the  statistical  properties  (mean  value  am  (a)  and 
variance  D0  (b»  of  Gaussian  random  process  of  dynamic  stresses:  1  -  5)  D0  is  equal  to  50,  100,  150,  200,  300 
MPa2  respectively;  6  -  12)  om  is  equal  to  100,  200,  300,  350,  400,  450,  500  MPa  respectively.  Line  a  =  1, 
corresponding  to  threshold  level  =  as=  310  MPa,  is  denoted  by  a  dashed  line. 


6  Example  2.  Nicolis  and  Prigogine's  Model 

A  process  of  nucleation  caused  by  formation  of  germs  in  model  system  involving  multiple 
steady  states  is  considered  by  Nicolis  and  Prigogine  [8,12].  The  plots,  representing  the 
time  evolution  of  the  average  value  and  the  variance  for  this  numerical  model,  are 
indicated  in  Figure  8,  which  was  adopted  from  [12]. 
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Figure  8.  Evolution  of  the  mean  value  X  (a)  and  variance  Dx  (b)  of  fluctuations  for  the  Nicolis-Prigogtne  s 
model.  Adopted  from  [12]. 


We  can  see  the  process  of  the  model  system  transition  from  initial  state  to  another  stable 
state  corresponding  to  a  higher  level  of  mean  value.  During  the  process  the  mean  value  of 
determining  parameter  monotonously  increases  (See  Fig.  8a).  The  variance  begins  to 
increase  sharply  as  soon  as  sufficiently  large  new  domain  appears  in  the  system  through 
fluctuations.  Appearance  of  the  typical  peak  on  the  curve  Dx(t)  and  subsequent  stab¬ 
ilization  of  the  variance  on  a  new  steady  level  (See  Fig.  8b)  lead  to  a  conclusion  that  the 
transition  period  is  finished 

The  plots  of  Figure  8  correspond  to  analytical  dependencies,  Eqs.  (2),  (3)  of  mean 
value  X  ( t)  and  variance  Dx(t)  of  determining  parameter  X  with  time  (See  Figures  2  and 
3).  We  believe  that  on  a  basis  of  the  analysis  of  processes,  occurring  in  the  LDZ,  it  should 
be  possible  to  speed  up  the  reception  of  data  concerning  the  termination  of  the  transition 
as  well  as  adaptation  of  system  to  new  conditions. 


7  Conclusions 

The  various  kinds  of  open  systems  (technical,  ecological,  biological,  socio-economic, 
etc.)  respond  to  a  strong  change  of  being  conditions  in  the  same  way:  by  steep  growth  of 
the  entropy  flow  up  to  a  maximum  value  at  the  critical  point.  According  to  the  traditional 
interpretation  of  entropy,  it  means  that  the  disordering  and  chaos  in  the  system  increase  at 
this  stage  of  its  evolution.  The  critical  point  (being  the  stochastic  analogue  of  a  bifur¬ 
cation  point)  associates  with  the  self-organization  process,  that  is  the  destruction  of 
pattern  of  the  previous  hierarchical  level,  exhausting  its  possibilities,  and  resulting  in  the 
emergence  of  new  more  complicated  and  more  advanced  patterns,  corresponding  to  the 
changed  being  conditions.  An  open  system  of  any  nature  comes  back  to  a  steady  stable 
state  due  to  inflow  of  the  information  from  the  outside  or/and  redistribution  of  the 
informational  entropy  between  the  hierarchical  levels  of  system. 
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Obtained  mathematical  expressions  for  the  time  course  of  the  informational  entropy 
flow  make  it  possible  to  predict  the  moment  that  the  critical  point  will  be  reached  That  is 
the  moment  of  maximum  uncertainty,  instability,  and  chaos  in  the  system  when  small 
fluctuations  become  amplified  up  to  a  macroscopic  scale.  It  may  drive  the  open  system 
into  a  completely  new  state  and  thus  become  the  driving  force  of  the  system’s 
development. 
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Scaling  properties  in  financial  fluctuations  are  reviewed  from  the  standpoint  of  statistical  physics.  We 
firstly  show  theoretically  that  the  balance  of  demand  and  supply  enhances  fluctuations  due  to  the 
underlying  phase  transition  mechanism.  By  analyzing  tick  data  of  yen-dollar  exchange  rates  we  confirm 
two  fractal  properties:  1  The  distribution  of  rate  change  in  a  fixed  ticks  is  approximated  by  a  symmetric 
stretched  exponential  function  for  a  wide  range  of  time  intervals;  2  the  interval  time  distribution  of 
trades  nearly  follows  a  power  law.  Empirical  fractal  properties  in  companies'  financial  data,  such  as 
distributions  and  fluctuations  in  assets  and  incomes  are  discussed  with  a  simple  model.  The  importance 
of  methods  and  theories  for  phase  transitions  is  discussed. 


1  Introduction 

The  concept  of  fractals  has  been  spread  over  all  fields  of  sciences  and  technology, 
however,  it  seems  less  known  that  this  concept  was  bom  in  the  field  of  economics  when 
Mandelbrot  was  investigating  the  price  changes  in  an  open  market  around  1963  [1],  He 
found  empirically  that  a  chart  of  market  price  changes  of  cotton  price  looks  similar  to 
another  chart  with  different  time  resolution.  A  general  concept  flashed  across  his  mind 
that  such  scale  invariance  could  be  the  clue  to  characterize  many  complex  phenomena 
around  us.  After  about  20  years  the  concept  he  created,  the  fractals,  became  a  key  word  in 
a  wide  research  activity.  Now,  more  than  35  years  have  passed  since  his  original 
discovery,  economics  becomes  one  of  the  hottest  topics  in  the  study  of  fractals. 

A  new  field  of  science,  econophysics,  was  established  recently  in  1997  [2,  3].  This  is  a 
study  of  economic  phenomena  based  on  the  methods  and  approaches  of  physics.  Among 
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many  topics  in  econophysics  there  are  three  topics  that  are  closely  related  to  the  study  of 
fractals.  They  are  price  changes  in  open  market,  the  distribution  of  income  of  companies, 
and  the  scaling  relation  of  company's  size  fluctuations.  In  this  paper  we  briefly  review 
these  topics  and  discuss  the  validity  of  the  known  empirical  laws  introducing  simple 
mathematical  models  for  better  understanding. 

In  the  following  Section  we  discuss  price  fluctuations  which  naturally  involve  critical 
fluctuations  near  the  equilibrium  point  of  demand  and  supply.  In  Sec.  3  we  show  results 
of  tick  data  analysis  of  foreign  exchange  rates.  We  show  theoretically  that  the  fat  tails  of 
the  rate  change  distribution  can  be  modeled  by  a  very  simple  physical  equation  of 
Langevin  with  random  coefficient.  In  the  Sec.  4  we  review  the  empirical  laws  on 
companies'  macroscopic  financial  data  of  annual  income  and  assets.  We  introduce  a  very 
simple  model  of  company  growth  that  reproduces  the  Zipf  s  law  in  income  distribution, 
also  we  discuss  a  nonlinear  relation  between  income  and  assets.  Sec.  5  is  devoted  for 
general  discussion  and  concluding  remarks  on  the  importance  of  statistical  physics  for  the 
data  analysis  in  economics. 


2  Demand  and  supply;  a  phase  transition  view 

In  economics,  it  had  been  known  for  a  long  time  that  demand  and  supply  balance 
automatically.  However,  it  becomes  evident  that  in  reality  such  balancing  is  hardly 
realized  for  most  of  popular  commodities  in  our  daily  life  [4,  5].  The  important  point  is 
that  demand  is  essentially  a  stochastic  variable  because  human  action  can  never  be 
predicted  perfectly,  hence  the  balance  of  demand  and  supply  should  also  be  viewed  in  a 
probabilistic  way.  If  demand  and  supply  are  balanced  on  average,  the  probability  of 
finding  an  arbitrarily  chosen  commodity  on  the  shelves  of  a  store  should  be  1/2,  namely 
about  half  of  the  shelves  should  be  empty.  Contrary  to  this  theoretical  estimation,  shelves 
in  any  department  store  or  supermarket  are  nearly  always  full  of  commodities.  This 
clearly  demonstrates  that  supply  is  much  in  excess  in  such  stores.  Excess  supply  generally 
holds  for  most  of  commodities  and  especially  for  food  in  economically  advanced 
countries.  For  example,  it  is  reported  in  a  newspaper  that  about  a  quarter  of  food  produced 
in  the  USA  is  disposed  as  garbage.  If  food  supply  were  equal  to  demand  on  average  then 
it  implies  that  about  half  of  the  population  were  starving! 

In  general,  the  stochastic  properties  of  demand  and  supply  can  be  well  characterized 
by  a  phase  transition  view,  which  consists  of  two  phases:  the  excess-demand  and  excess- 
supply  phases.  It  has  been  mathematically  proven  that  for  commodities  whose  production 
costs  are  smaller  than  a  certain  portion  of  the  selling  prices,  an  excess-supply  strategy 
becomes  the  best  strategy,  when  the  demand  fluctuates  randomly  and  unsold  commodities 
are  to  be  disposed  [6].  Obviously  department  stores  follow  this  excess-supply  strategy. 
The  other  strategy  of  keeping  excess-demand  can  also  be  a  rational  solution,  when  the 
production  cost  and  the  selling  price  are  close.  The  critical  point  at  which  the  averaged 
demand  and  supply  are  equal  realizes  only  in  a  very  special  case. 

It  is  a  general  property  of  a  phase  transition  system  that  fluctuations  are  largest  at  the 
phase  transition  point  and  this  property  holds  also  in  this  demand-supply  system.  In  the 
case  of  markets  of  ordinary  commodities,  consumers  and  providers  are  independent  and 
the  averaged  supply  and  demand  are  generally  not  equal.  The  resulting  price  fluctuations 
are  generally  slow  and  small  in  such  market  because  the  system  is  out  of  the  critical  point. 
On  the  contrary  in  an  open  market  of  stocks  or  foreign  exchanges,  market  is  governed  by 
speculative  dealers  who  frequently  change  their  positions  between  buyers  and  sellers.  It  is 
shown  that  such  speculative  actions  make  demand  and  supply  balance  automatically  on 
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average  by  changing  the  market  price  [5].  As  the  system  is  always  at  the  critical  point,  the 
resulting  price  fluctuations  are  generally  quick  and  large  involving  fractal  properties. 


3  Price  changes  in  open  markets 

The  physical  mechanism  of  market  price  fluctuations  was  firstly  studied  by  one  of  the 
authors  (H.  T.)  et  al  in  '92  by  introducing  a  numerical  market  model [7].  The  model 
consists  of  speculative  dealers  who  transact  with  others  simply  following  the  basic  rule 
"buy  at  a  lower  price  and  sell  at  a  higher  price".  These  two  threshold  prices  are 
determined  at  each  time  step  by  each  dealer  taking  into  account  the  information  of  past 
market  price  changes.  It  is  shown  that  even  a  smallest  limit  case  of  3  dealers  can  show 
chaotic  behavior,  implying  that  the  transaction's  nonlinear  effect  is  very  strong.  The 
resulting  market  prices  naturally  include  sudden  falls  or  increases  when  the  parameter 
controlling  the  dealers'  averaged  response  to  latest  price  change  is  large  enough. 

In  '95  a  famous  paper  by  Mantegna  and  Stanley  appeared  in  Nature  in  which  price 
changes  in  a  stock  market  are  reported  to  follow  a  power  law  distribution  with  truncation 
at  higher  values  [8].  The  exponent  of  the  power  law  is  estimated  as  1.4  in  the  form  of 
stable  distribution's  characteristic  exponent.  However,  this  power  law  has  not  been 
accepted  widely  and  the  precise  functional  form  of  the  distribution  of  open  market  price 
changes  is  still  under  intensive  debate. 

Here,  we  show  our  original  results  of  analysis  on  tick  data  of  yen-dollar  exchange  rates 
for  about  six  months.  Tick  data  are  the  highest  resolution  transaction  data  of  foreign 
exchange  markets,  which  are  announced  to  the  dealers  electronically.  Transactions  of 
foreign  exchanges  are  done  by  telephone  calls,  so  the  market  is  active  when  more  than  2 
dealers  are  working  on  the  globe,  namely,  continuous  tick  data  can  be  obtained  except  for 
weekends.  The  tick  data  monitor  several  representative  dealers'  transactions  and  the 
number  of  data  is  about  12,000  every  day.  We  analyzed  a  half  year  data  from  October  '98 
to  March  '99,  and  the  total  number  of  ticks  exceeds  a  million. 

Fig.  1  shows  a  typical  example  of  yen-dollar  rate  changes  in  3  different  time  scales. 
Intuitively,  this  figure  demonstrates  a  fractal  property  of  exchange  rates  in  the  time  axis 
measured  by  ticks,  namely,  Mandelbrot's  classical  finding  also  holds  for  this 
contemporary  market  price  fluctuation.  The  statistics  of  this  fluctuation  is  very  close  to 
random  walk,  actually  it  is  easy  to  confirm  that  the  power  spectrum  of  this  fluctuation 
clearly  follows  an  inverse  square  law,  that  is  almost  identical  to  a  Brownian  motion.  The 
corresponding  auto-correlation  function  for  rate  fluctuation  per  tick  defined  by  the 
following  equation  decays  very  quickly  as  shown  in  Fig.  2. 

-  <  MTq  +T)Ar(T0)  >  -  <  ArCTp)  >2 
<Ar(7’o)2>-<M7’o)>2 

where  A r(T)  denotes  the  rate  change  at  F-th  tick,  namely,  A r(T)  =  r(T)  -r(T  -1)  with 
r(T)  being  the  exchange  rate  at  F-th  tick  time,  and  <...>  shows  average  over  tick  times 
F0 .  The  correlation  is  virtually  0  even  at  F=  2,  which  corresponds  to  the  averaged 
physical  time  of  only  15  seconds.  As  seen  from  this  property,  the  rate  change  fluctuation 
is  very  close  to  a  white  noise. 

The  distribution  of  rate  change  in  a  tick  is  shown  in  Fig.  3  on  semi-log  plot  with  a  best- 
fitted  normal  distribution  for  comparison.  The  tails  are  much  fatter  than  the  normal 
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Figure.  2  Autocorrelation  of  rate  changes  observed  by  tick  data. 

distribution  and  we  can  find  extreme  events  deviating  from  the  mean  value,  <  Ar  >=  0.0 , 
with  magnitude  nearly  50  times  larger  than  the  standard  deviation.  In  order  to  clarify  the 
functional  form  of  this  distribution  we  plot  a  cumulative  distribution  in  log-log  scale  in 
Fig.4.  The  distribution  for  A r  in  the  negative  part  is  plotted  by  taking  absolute  values. 
The  distribution  functions  for  both  tails  are  quite  identical  except  for  the  very  large  events 
for  which  the  number  of  data  is  limited.  The  tails  decay  gradually  in  log-log  scale  and  we 
can  not  find  a  linear  slope.  We  can  approximate  the  whole  shape  of  the  distribution  by  a 
symmetric  stretched  exponential  function  as 

P(a|  Ar  |)  =  0.28exp[-7.0(|  Ar  |  -0.04) 0  4  ]  ,  |  Ar  |;>  0.05  (2) 

The  dashed  curve  gives  this  function  which  is  plotted  with  a  shift  for  better  comparison. 
For  small  absolute  values  the  function  does  not  fit  with  this  analytic  form  because  real 
data  take  discrete  values  which  are  given  by  0.01  times  integers.  Omitting  about  10  points 
around  the  center  the  theoretical  fitting  is  quite  plausible  in  a  wide  range.  Note  that  this 
function  decays  gradually  in  the  log-log  plot,  so  if  we  dare  to  fit  a  linear  slope  then  we 
might  deduce  a  non-universal  power  law  that  might  depend  on  the  fitting  range. 

We  observe  daily  change  of  the  distributions  of  rate  fluctuations  per  tick  and  find  that 
the  distributions  are  always  quite  symmetric  and  the  functional  forms  are  always  similar 
to  Eq.(2),  however,  the  standard  deviation  can  change  more  than  ten  times  from  0.03  upto 
0.36.  This  phenomenon  is  related  to  the  problem  of  long  time  correlation  in  volatility 
fluctuations  observed  in  any  open  market  price  [3].  We  can  easily  confirm  that  volatility 
measured  by  the  absolute  values  of  the  rate  changes  has  a  long  correlation,  however,  the 
details  of  its  functional  forms  are  yet  to  be  clarified. 
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Figure.  5  Distribution  of  time  interval  between  two  successive  trades  in  log-log  scale. 

The  dotted  line  shows  a  power  law  with  exponent  -1.8. 

The  time  measured  by  tick  is  obviously  different  from  the  physical  time.  The 
occurrence  of  trades  is  also  a  stochastic  process  and  we  observe  the  distribution  of  tick 
intervals.  In  Fig.5  the  cumulative  distribution  of  tick  intervals  measured  by  seconds  are 
plotted  in  log-log  scale.  Here,  the  minimal  tick  interval  is  0  second  and  the  largest  is 
about  5600  seconds,  and  intervals  longer  than  this  are  neglected  because  they  include 
weekends  or  holidays.  The  scale  range  is  therefore  limited,  however,  we  can  find  a  power 
law  tail  in  the  scale  range  of  [30, 1600]  in  seconds. 

These  statistical  properties  are  not  limited  to  yen-dollar  rates  but  they  are  applicable  to 
other  major  currency  exchanges.  Similarities  can  be  extended  to  stock  markets  or  other 
open  markets  to  some  extent,  however,  in  such  markets  trading  volumes  are  much  smaller 
than  currency  exchange.  The  total  money  flow  in  foreign  currency  market  in  one  day  is 
estimated  to  be  more  than  2*1014  yen  which  is  roughly  a  half  of  the  annual  gross  domestic 
product  (GDP)  of  Japan.  Other  than  foreign  exchanges  the  markets  are  open  only  in  office 
hours,  so  the  statistics  are  expected  to  be  less  universal  than  the  case  of  currency  markets. 
For  example,  currency  market  fluctuations  are  generally  symmetric,  however,  in  stock 
markets  there  sometimes  appear  asymmetric  properties  such  as  crashes.  We  believe  that 
universal  properties  can  be  more  clearly  observed  in  the  foreign  currency  exchange 
markets  than  any  other  markets. 

The  fat  tail  properties  in  the  rate  changes  can  be  explained  by  introducing  a  simple 
mathematical  model.  It  is  deduced  from  the  analysis  of  numerical  market  model  that  the 
market  price  changes  can  be  approximately  described  by  a  Langevin  equation  with 
fluctuating  coefficient  [9]. 


A r(t  +  A/)  =  b(t)Ar(t)  +  f(t) 


(3) 
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Here,  in  this  formulation  the  time  t  can  be  either  tick  time  or  physical  time,  /(/)  and 
bft)  represent  the  random  force  term  and  random  coefficient,  respectively.  The  random 
force  term  comes  from  the  scatter  of  dealers'  characters  and  predictions.  The  value  of 
bit)  is  larger  when  the  dealer’s  averaged  responses  to  the  latest  price  change  is  larger.  It 
is  known  that  statistical  properties  of  A r(t)  are  fully  characterized  by  the  statistics  of 
bft)  [10].  In  the  case  that  bft)  is  always  smaller  than  1  the  distribution  of  A r(t)  is 
known  to  follow  a  stretched  exponential  form.  If  the  probability  of  b{t)  taking  a  value 
larger  than  1  is  not  zero,  the  distribution  of  A rft)  has  power  law  tails.  The  exponent  of 
the  power  law,  p  ,is  given  by  a  simple  formula, 

P(>|  Ar  |)  x  |  Ar  with  <b(t)^  >=  1  (4) 

here  <...>  denotes  ensemble  average.  Eq.(4)  is  mathematically  rigorous  in  the  range 
0  <  p  <  2 .  When  large  values  appear  successively  in  bft),  Ar  is  enhanced  successively 
by  the  recursion  of  Eq.(3)  resulting  a  large  value  in  A r . 

An  interesting  point  in  this  discussion  is  that  Langevin  equation  is  very  popular  in  a 
wide  area  of  physical  systems,  therefore,  a  situation  that  is  similar  to  the  case  of  open 
market  can  be  realized  in  purely  physical  systems.  For  example,  one  of  the  authors  (H.T.) 
and  his  collaborators  are  now  developing  an  electrical  circuit  that  has  a  randomly 
fluctuating  resistance.  Note  that  the  case  b(t)>  1  corresponds  to  a  negative  resistance,  that 
is,  an  amplification,  in  the  continuum  limit  of  Eq.(3).  According  to  our  preliminary  results 
the  statistics  of  its  voltage  fluctuations  caused  by  thermal  noise  shows  similarity  with  that 
of  price  changes  or  rate  changes  in  open  markets  [11]. 

The  model  equation,  Eq.(3),  captures  a  basic  aspect  of  price  changes  in  open  market, 
however,  it  is  obviously  too  simple.  Two  of  the  authors  (H.T.  and  M.T.)  have  already 
derived  a  more  general  basic  equation  of  price  changes  by  theoretical  considerations  of 
the  dynamics  of  dealers  [5],  Skipping  the  derivation,  the  general  equation  is  a  set  of  linear 
equation  with  random  force  and  random  coefficients  as 

rft  +  At)  =  rft)  +  a(t)(r*(t)  -  r(t))  (5. a) 

r*(t  +  A/)  =  r*(t)  +  f(t)  +  bit)ir(t)  -  r(t  -  At))  (5.b) 

where  r*(t)  is  a  virtual  equilibrium  price  that  is  determined  by  the  balanced  price  of 
demand  and  supply  when  all  dealers  show  their  buying  and  selling  prices  in  mind  openly. 
The  coefficient  aft)  shows  the  market  price's  response  against  the  change  of  demand  and 
supply  that  is  inversely  proportional  to  the  price  rigidity  in  economics,  namely,  for  larger 
aft)  the  market  price  changes  more  largely  for  the  same  modification  of  demand-supply 
configuration.  For  given  {aft)}  and  {bft)}  with  initial  condition  for  rft)  and  r*ft)9  Eq.(5a) 
and  (5b)  solve  the  time  evolution  of  rft)  and  r*ft)  simultaneously.  In  the  special  limit  that 
rft)  is  always  nearly  equal  to  r*(t)  these  set  of  equations  become  identical  to  Eq.(3),  then 
the  basic  properties  of  Eq.(3)  are  recovered. 

There  are  3  typical  behaviors  in  the  time  evolution  of  Eqs.(5).  One  is  a  stable  state  in 
which  the  deviation  between  rft)  and  r*ft)  does  not  increase  on  average  which  is  realized 
when  both  aft)  and  bft)  are  smaller  than  1.  Another  behavior  is  an  exponential  growth  of 
rft)  which  corresponds  to  the  phenomenon  called  "bubble".  This  behavior  occurs  when 
bft)  is  larger  than  1  and  aft)  is  smaller  than  1,  namely  when  the  dealers  are  strongly 
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expecting  the  latest  trend  to  be  continued  and  the  price  rigidity  is  rather  high.  In  such  case 
the  expected  equilibrium  price  r*(t)  goes  ahead  while  rft)  follows  it,  and  the  deviation 
between  rft)  and  r*(t)  increases  exponentially.  The  third  behavior  is  an  oscillatory 
fluctuation  that  happens  when  aft)  is  larger  than  1.  In  such  case  the  price  rigidity  is  so 
small  that  the  market  price  goes  over  r*ft)  causing  an  oscillation. 

There  are  two  obvious  defects  in  the  simplest  model  equation,  Eq.(3);  the  lack  of  long 
time  correlation  in  volatility  and  the  absence  of  characteristic  behaviors  such  as  bubbles 
and  oscillations  which  are  observable  in  real  open  market  data.  We  are  now  studying  the 
basic  properties  of  the  generalized  version,  Eqs.(5),  comparing  with  real  data,  so  that  the 
values  of  the  coefficients,  aft)  and  bft),  can  be  estimated  from  the  real  time  sequences  of 

rfth 


4  Company’s  macroscopic  financial  data  analysis 

Histoiy  of  study  on  fractal  properties  in  money  flow  can  be  traced  back  more  than  100 
years.  In  1896  Pareto  investigated  personal  incomes  in  Italy  and  found  that  the 
distributions  are  approximated  by  a  power  law  [12].  In  1922  Gini  explored  the  same 
quantity  for  several  European  countries  and  reported  that  the  exponents  of  the  power  laws 
are  different  for  different  country  [13].  Two  physisists,  Montroll  and  Schlesinger,  paid 
attention  to  the  power  law  nature  of  income  distribution  in  1983  and  found  that  personal 
income  of  top  1%  follows  a  power  law  while  the  rest  of  99%,  who  are  mostly  salaried,  are 
characterized  by  a  log-normal  law  [14].  Recently,  fractal  properties  of  company's  income 
and  asset  are  attracting  much  interest. 

M.  H.  R.  Stanley  et  al  established  an  interesting  scaling  relation  on  the  statistics  of 
growth  of  companies  [15].  Let  Aft)  be  the  asset  of  a  company  at  the  Mh  year,  where  asset 
means  intuitively  the  price  of  the  company  as  a  whole.  Asset  growth  is  characterized  by 
the  growth  rate  that  is  defined  by  logarithm  of  the  ratio  of  asset  change, 

R(t)  =  log  (6) 

A(t- 1) 


The  distribution  function  of  Rft)  has  been  observed  for  more  than  10,000  companies  in 
USA  and  for  about  ten  years.  It  is  shown  that  the  probability  density  of  Rft)  for  a 
company  with  asset  A  is  approximately  given  by  an  exponential  function 


p(R\A)  = 


V2 a(A)  a(A) 


(7) 


where  cr(A)  is  the  standard  deviation  for  asset  size  around  A(t)=A.  There  exists  a  non¬ 
trivial  scaling  relation  in  a(A)  such  as 


cr(A)<xA~0  ,  0-0.15  (8) 

Namely,  for  larger  asset  companies  the  fluctuations  are  relatively  smaller,  which  is 
intuitively  very  natural. 
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There  are  several  quantities  that  characterize  the  size  of  companies  such  as  the  number 
of  employee  and  net  sales,  and  it  is  confirmed  that  characterization  by  using  these 
quantities  also  leads  the  same  result.  Also  this  type  of  scaling  relation  is  known  to  hold 
not  only  in  USA  but  also  for  other  countries,  for  example,  Japan,  France  and  Italy  [16]. 

If  all  of  the  components  of  a  company  fluctuate  independently  then  it  is  easy  to  show 
that  the  value  of  the  exponent  6  should  be  0.5,  while  if  the  growth  is  merely  proportional 
to  the  whole  asset  then  6  should  be  0.  Therefore,  the  non-trivial  value  of  6  strongly 
suggests  that  the  company  growth  is  governed  by  non-trivial  interactions  either  internal 
origins  [17]  or  external  ones  [16]. 

Universality  of  the  scaling  relation,  Eq.(8),  is  still  wider.  It  is  known  to  be  valid  also 
for  the  growth  of  gross  domestic  product  (GDP)  of  more  than  150  countries  for  the 
periods  of  more  than  40  years  after  the  world  war  II  [18].  Although  the  detail  mechanisms 
of  these  phenomena  are  yet  to  be  clarified  it  is  likely  that  gigantic  economic  systems  seem 
to  fall  in  the  same  universality  class  in  which  non-trivial  fractal  relation  holds. 

Very  recently  two  of  the  authors  (H.T.  and  M.T.)  together  with  Okuyama  examined 
detailed  data  of  annual  incomes  of  companies  in  Japan  which  had  been  reported  to  the  tax 
office  and  they  found  that  a  clear  power  law  with  exponent  very  close  to  -1  holds  (see 
Fig-6)  [19].  The  data  cover  all  Japanese  companies  whose  incomes  are  larger  than  4.0*107 
yen,  and  the  number  of  the  companies  is  about  85,000.  Here,  income  is  in  rough  sense 
given  by  the  total  sales  subtracted  by  the  total  outgoing  costs.  There  are  fluctuations  for 
very  large  incomes,  however,  for  companies  having  income  less  than  1010  yen  the  plots 
are  clearly  on  a  straight  line  in  the  log-log  plot.  Such  power  law  behaviors  can  also  be 
confirmed  for  other  years  and  for  several  other  countries. 


Figure.  6  Log-log  plot  of  the  income  distribution  of  Japanese  companies  for  all  job  categories. 
Income  x  is  represented  in  the  unit  of  million  yen. 
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Figure.  7  Income  distribution  for  the  category  "constructions". 
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Figure.  8  Income  distribution  for  the  category  "banks". 

An  interesting  feature  of  this  power  law  distribution  is  that  similar  power  law  holds  in 
each  category  of  industry.  For  example,  Fig.7  shows  the  same  plot  as  in  Fig.6  but  the 
companies  are  limited  only  those  categorized  in  "constructions".  The  number  of 
companies  is  about  11,000  and  the  estimated  power  exponent  is  -1.1,  very  close  to  the 
case  of  the  whole  categories. 
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By  carefully  observing  the  distributions  of  income  in  each  job  category  we  find  that 
there  are  some  exceptions  that  do  not  follow  the  power  law.  For  example,  in  Fig.8  we 
show  the  distribution  of  the  case  of  category  "banks".  Here,  we  can  not  find  a  clear  slope 
in  the  plot.  The  existence  of  bending  point  around  102  million  yen  implies  that  the  number 
of  banks  whose  incomes  are  less  than  this  amount  is  very  little.  In  Japan  there  were  many 
regulations  by  law  for  banks  for  example  the  interest  rates  were  standardized  all  over  the 
countiy,  which  causes  little  competition  among  banks,  also  a  new  bank  could  not  be 
established  easily.  As  symbolized  in  this  example  deviation  from  power  law  distributions 
tends  to  be  observed  in  job  categories  which  are  less  competitive  compared  with  other  job 
categories.  Recently,  Japanese  bank  regulations  have  been  loosened  drastically,  so  the 
future  change  in  the  distribution  of  income  of  banks  should  attract  not  only  economists' 
but  also  scientists'  attention. 

What  is  the  relation  between  asset  and  income,  as  both  are  characterizing  company's 
statistics?  To  be  precise  the  income  defined  above  includes  taxes  to  be  paid  to  the 
government  that  is  about  50%  independent  of  the  asset  or  income  size  in  Japan.  The  rest 
of  the  income  is  normally  added  to  the  company's  asset,  namely,  the  following  relation 
holds  in  rough  estimation. 


A(t  +  \)=A(t)  +  I(t)l2  (9) 

where  I(t)  denotes  the  income  of  the  /-th  year.  It  is  confirmed  from  the  data  that  I(t)  is 
generally  about  2  orders  smaller  in  magnitude  and  it  can  take  a  negative  value,  while  the 
company  collapses  w\&nA(t+l)  becomes  negative.  Comparing  Eq.(9)  with  Eq.(6)  we 
have  a  relation  between/? (t)  and I(t)  as 

The  relation  between  income  and  asset  can  be  compared  directly  by  plotting  these  two 
quantities.  Scatters  are  generally  large  but  the  following  nonlinear  relation  has  been  found 
as  average  [19]. 


I(t)ccA(t)s  ,  S  =  0.85  (11) 

These  relations  (10)  and  (11)  are  consistent  with  Eq.(8)  and  we  confirm  a  relation 
between  the  exponents,  S  -  1  -  0 .  This  nonlinear  relation  implies  that  large  systems  are 
less  efficient,  which  is  intuitively  well  appreciated. 

So  far  no  mathematical  model  is  known  that  reproduces  all  of  these  empirical  relations 
of  company's  financial  data.  Here,  we  introduce  a  simplest  mathematical  model  based  on 
competitive  growth  that  realizes  the  power  law  distribution  with  exponent  -1.  Let  us 
consider  a  2-dimensional  lattice  and  we  regard  each  site  as  a  source  of  unit  income. 
Initially  all  the  sites  on  the  top  line  are  occupied  by  different  companies,  therefore, 
incomes  of  all  companies  are  1.  In  the  second  step,  a  company  on  the  top  line  is  chosen 
randomly  and  it  occupies  a  site  on  the  second  line,  and  repeat  this  process  until  all  sites  on 
the  second  line  are  occupied.  In  the  third  step  a  company  is  chosen  randomly  from  the 
companies  on  the  second  line  with  the  probability  weight  proportional  to  the  total  number 
of  occupied  sites  on  the  top  and  second  lines,  and  let  it  occupy  a  site  on  the  third  line 
randomly.  Repeat  this  procedure  until  all  sites  on  the  third  line  are  occupied.  The  same 
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Figure.  9  The  company  growth  model  based  on  competitive  growth. 


Figure  10  Log-log  plot  of  income  distribution  by  the  growth  model. 

growth  process  is  repeated  for  many  lines  (see  Fig.  10).  Note  that  if  a  company  shares  no 
site  on  the  k-th  line,  then  the  company  stops  growing  any  more. 

In  this  model  each  new  occupation  can  be  regarded  as  a  unit  investment  that  works. 
This  growth  process  is  based  on  the  intuition  that  a  company's  income  grows  with 
probability  proportional  to  investment  and  the  amount  of  investment  is  assumed  to  be 
either  0  or  proportional  to  the  whole  income.  The  result  of  this  numerical  simulation  is 
shown  in  Fig.9.  We  can  find  a  clear  power  law  with  exponent  -1,  which  is  consistent  with 
Fig.6.  Of  course  this  model  captures  just  one  aspect  of  the  phenomenon  and  we  need  to 
develop  a  model  that  explains  both  the  asset  and  income  statistics. 
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5  Discussions 

We  have  introduced  three  topics  in  this  paper;  the  balance  of  demand  and  supply,  price 
fluctuations  in  open  markets  and  company's  macroscopic  financial  data  analysis.  These 
topics  may  look  very  different  superficially,  however,  they  are  deeply  interconnected  by 
the  key  words  "fluctuations".  Fluctuations  had  been  ignored  in  economics  especially  in 
the  discussion  of  demand  and  supply,  however,  as  we  reviewed  in  the  second  chapter  the 
magnitude  of  fluctuations  becomes  very  large  and  quick  at  the  balanced  point  of  demand 
and  supply.  From  the  viewpoint  of  statistical  physics  the  balanced  point  is  the  critical 
point  of  the  underlying  phase  transition  and  the  fluctuations  tend  to  show  fractal 
characteristics,  namely,  the  averaged  amplitude  of  fluctuations  can  be  infinite  in  a 
theoretically  ideal  situation. 

Economic  systems  tend  to  be  around  the  critical  point  in  general.  Open  markets  are 
typical  examples.  When  there  are  more  buyers  than  sellers,  the  market  price  goes  up 
causing  decrease  in  the  number  of  buyers  and  increase  in  sellers.  Such  well-known 
pulling  back  mechanism  obviously  keeps  the  system  around  the  critical  point.  The 
important  fact  is  that  this  criticality  does  riot  stabilize  the  absolute  value  of  the  market 
price  but  only  stabilizes  the  statistics  of  market  price  fluctuations  to  follow  the  critical 
fluctuations.  As  the  dealers  in  open  markets  tend  to  care  about  only  the  relative  market 
price,  whether  it  goes  up  or  down,  so  the  absolute  value  of  the  market  price  is  nearly 
meaningless  for  the  determination  of  the  market  price.  Therefore,  the  market  prices 
generally  wander  almost  randomly  having  statistics  characterized  by  the  critical  point. 

From  the  viewpoint  of  fluctuations  in  demand  and  supply,  company's  statistics  may 
also  be  considered  in  a  similar  way.  When  there  appears  a  new  demand  in  a  new  field  of 
industry,  companies  competitively  try  to  supply  it.  The  winner  will  get  a  big  amount  of 
income  and  the  company  will  grow.  The  followers  will  share  the  rest  of  demand  and  the 
total  supply  will  increase.  When  the  sum  of  supply  becomes  close  to  the  demand,  some 
companies  can  not  gain  their  incomes  and  the  growth  of  supply  is  weakened.  As  a  result 
demand  and  supply  may  nearly  balance  on  average  and  the  whole  system  may  show 
critical  characteristics  such  as  fractal  distributions. 

The  traditional  economics  theory  of  demand  and  supply  neglects  the  effect  of 
fluctuations  at  all,  while  the  modem  financial  technology  is  based  on  continuous 
independent  Gaussian  process.  From  the  standpoint  of  statistical  physics  these  two  cases 
are  extreme  limits  which  correspond  to  the  cases  of  absolute  zero  and  infinite 
temperatures,  respectively.  However,  interesting  phenomena  such  as  phase  transition  and 
pattern  formation  can  never  happen  in  such  situations  in  the  case  of  physical  systems.  As 
we  have  shown  in  this  paper,  we  can  find  critical  behaviors  in  real  economic  data,  which 
considerably  resemble  physical  systems'  phase  transition  phenomenon.  It  is  veiy  likely 
that  there  hidden  in  economic  data  many  more  examples  which  possess  fractal  properties, 
because  the  balance  of  demand  and  supply  automatically  attract  any  economic  system  to 
be  around  the  critical  point.  Data  analysis  methods  and  theoretical  formulations 
developed  for  phase  transition  phenomena  in  physics  will  become  more  and  more 
important  in  economics  in  the  near  future. 
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In  the  framework  of  the  Comprehensive  Test  Ban  Treaty,  development  of  reliable 
methods  to  discriminate  between  underground  nuclear  explosions  and  earthquakes 
at  regional  distances  (less  than  2500km)  continues  to  be  very  important  especially 
in  connection  with  the  last  (in  May,  1998)  nuclear  explosions  conducted  at  Indian 
and  Pakistan  test  sites.  Since  the  lithosphere  is  a  fractal,  we  suppose  the  signals, 
which  propagate  through  the  media,  inherit  its  ’self-similar’  (scaling)  features.  We 
assumed  that  these  features  of  explosions  and  earthquakes  or  their  topological 
reconstructions  (embeddings)  have  to  be  different.  Scaling  reflects  correlations  of 
more  high  order  then  it  is  possible  to  estimate  by  linear  discriminating  methods  and 
can  be  used  as  base  of  non-linear  discrimination.  We  propose  to  build  a  universal 
geometrical  model  of  a  seismic  signal  using  the  canon  algorithm  of  F .  Takens  and 
to  estimate  scaling  of  the  model.  The  scaling  features  were  used  as  patterns  of 
seismic  signals  for  entering  them  into  an  artificial  neural  network.  Records  of 
nuclear  explosions  and  earthquakes  from  different  regions  were  included  into  the 
training  set.  The  net  was  trained  to  classify  types  of  seismic  events.  Results 
have  shown  89%  correct  classification  of  the  unknown  signals.  As  additional  tools 
for  distinguishing  between  nuclear  explosions  and  earthquakes  we  propose  to  use 
Hurst’s  method  and  the  cross  correlation  method.  Results  of  using  these  methods 
are  demonstrated  on  examples  of  some  explosions  and  earthquakes. 


1  Introduction 


The  nuclear  explosion  discrimination  problem  continues  to  be  very  important  espe¬ 
cially  in  connection  with  the  last  (  in  May,  1998)  nuclear  explosions  conducted  at 
Indian  and  Pakistan  test  sites.  Existing  regional  methods  of  seismic  event  discrim¬ 
ination  x’2>3’4’5  are  based  on  the  comparative  analysis  of  spectral  characteristics  of 
two  main  components  (P-wave  and  S-wave)  of  a  seismogram  (see  Fig.  1).  How¬ 
ever,  these  parameters  are  very  sensitive  to  non-uniformity  of  the  lithosphere  and 
the  astenosphere  and  depend  on  the  location  of  the  event  and  the  path  of  a  signal 
propagation.  Moreover,  modern  technology  of  nuclear  testing  complicates  distin¬ 
guishing  between  nuclear  explosions  and  earthquakes. 

It  is  known  that  the  lithosphere  exibites  fractal  features  6,7  in  a  wide  range  of 
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Figure  1.  A  seismogram  of  the  Chinese  earthquake,  24.07.86 


scales  -  from  parts  of  millimetre  of  geological  grain  to  tens  thousands  kilometres 
of  a  tectonic  plateau  and  has  hierarchy  of  self-similar  blocks.  Response  of  the 
fractal  lithosphere  depends  on  source’s  geometry  (radiation  patterns)  of  a  seismic 
event  more  than  on  its  yields.  Moreover,  an  earthquake  centre  is  formed  in  the 
media  during  the  long  time.  Features  of  the  media  are  changed  by  the  moment  of 
the  earthquake,  which  occurs  in  the  preliminary  stressed  media.  An  explosion  is  an 
artifact  for  the  media  and  is  not  connected  with  any  preliminary  changes  of  external 
parameters  of  the  media.  Since  dynamic  scenarios  of  explosions  and  earthquakes 
are  different,  we  assumed  that  scaling  features  of  seismic  signals  or  their  topological 
reconstructions  (embeddings)  have  to  be  different.  Scaling  reflects  correlations  of 
higher  order  than  it  is  possible  to  estimate  by  linear  discriminating  methods,  and 
can  be  used  as  base  of  non-linear  discrimination.  We  propose  to  build  a  universal 
geometrical  model  of  a  seismic  signal  using  the  canon  Takens’  algorithm  8)9  and  to 
use  its  scaling  features  as  attributes  of  seismic  event  discrimination. 

The  structure  of  this  paper  is  as  follows.  In  Sec.  2  we  study  ID  scaling  features 
of  seismograms.  In  Sec.  3  we  study  correlation  dimension  of  reconstructions  of 
explosions  and  earthquakes.  In  Sec.  4  we  use  the  scaling  features  of  embeddings 
of  seismic  signals  as  patterns  for  a  neural  network.  In  Sec.  5  we  describe  the 
cross  correlation  method  as  a  tool  for  seismic  discrimination.  Our  findings  are 
summarized  in  the  conclusion  section. 
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2  Hurst’s  exponents  of  seismograms 

We  started  from  the  study  of  self-similar  properties  of  the  processes  in  seismic 
sources  applying  Hurst’s  method  10  to  the  seismograms  of  nuclear  explosions  and 
earthquakes.  Let  us  remind  the  classical  notion  of  self-similarity  for  a  random 
process11.  X(t)  is  a  self-similar  process  in  if  there  exists  a  sequence  of  positive 
real  numbers  tv  such  that,  for  any  r  >  0, 

X(t)  ±crX{rt),t€ft  (1) 

where  =  denotes  equality  of  all  finite-dimensional  distribution.  This  equation  is 
a  statement  of  invariance  of  X  (t)  under  the  group  of  affine  transformations  X  — > 
crX,t  ->  r£,  cr  >  0.  Since  cajb  =  cacb  and  ci  =  1,  cr  must  have  the  form  rH  for 
some  H  >  0  and  this  formula  might  be  modified  as 

X(t)=r-HX(rt)tt€$l  (2) 

where  H  is  the  Hurst  exponent.  Traditionally  it  is  estimated  by  the  rescaled  range 
method  10 . 

We  analyzed  seismograms  of  the  nuclear  explosions  conducted  in  India  and  Pak¬ 
istan  and  Tibetian  earthquakes.  Fig.  2  represents  the  Hurst’s  exponents  for  different 
types  of  seismic  events.  Analysis  of  the  Hurst’s  exponents  allowed  to  conclude  that 
graphs  of  the  earthquakes  are  more  regular  than  graphs  of  the  explosions  because 
H  estimated  for  earthquakes  equals  1  on  larger  range  of  scales  (i.e.  logT  G  [0, 1.5]) 
than  for  explosions  (i.e.  logT  G  [0, 1]). 

It  is  necessary  to  note  that  the  procedure  described  above  is  suitable  only  for 
records  with  low  level  of  noise. 

3  Correlation  integral  as  a  tool  for  distinguishing  between  different 
seismic  sources 

The  estimation  of  the  correlation  dimension  12  of  attractors  reconstructed  directly 
from  experimental  time  series  is  often  used  means  of  gaining  information  about 
the  nature  of  the  underlying  dynamics.  It  is  proposed  that  a  scalar  time  series 
y(t )  =  =  1,2,. ..,7V  have  been  generated  by  a  smooth  dynamical  system 

x(£  +  t)  =  fT(x(t))  defined  on  a  manifold  M  with  an  attractor  A  G  M  such  that 
yi  =  /i(x(it))  where  h  :  M  —>  3ft  is  Lipschiz  function.  Then  the  reconstruction 

y(0  =  {yuVi+\, =  .. 

=  {h(x{t)),  h(x(t  -  t)),  h(x(t  -  2 t)),  h(x(t  -  (m  -  l)r))} 

defines  a  delay- coordinate  map  A  :  M  — >  and  A  — >  A $  is  the  reconstructed 

attractor.  Takens’  theorem  8,9  ensures  that  if  m  >  2D  where  D  is  the  box- counting 
dimension  (or  capacity)  of  A,  then  the  map  A  is  embedding,  i.e.  is  one-to-one,  and 
also  an  immersion  with  a  precision  upto  assumption  about  prevalence  13 . 

For  Takens’  theorem  to  be  valid,  we  need  to  assume  that  both  the  dynamics  and 
the  observations  are  autonomous  (so  that  f  and  h  depend  on  x  only).  Unfortunately, 
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Figure  2.  Hurst  exponent  H  curves  for  different  seismic  events:  Pakl  -  nuclear  explosion  30.05.98 
(Pakistan),  ind  -  nuclear  explosion  11.05.98  (India),  tibl  -  earthquake  (Tibet),  tib2  -  earthquake 
(Tibet),  tib3  -  earthquake  (Tibet) 


these  assumptions  fail  to  hold  for  seismic  sources.  However,  there  exists  Takens 
embedding  theorem  for  forced  and  stochastic  systems  14 .  We  applied  the  technique 
of  reconstruction  described  above  basing  on  the  last  theorem  and  assuming  that 
the  seismograms  are  typical  and  contain  all  the  character  scales  of  the  dynamics  of 
the  seismic  sources.  It  means  that  in  some  sense  the  attractor  of  the  seismic  source 
exists. 

We  reconstructed  some  old  non-camouflaged  nuclear  explosions  and  found  out 
that  their  embeddings  in  3£2  visually  differ  from  ones  of  earthquakes  (see  Figures  3- 
4).  Unfortunately,  this  difference  for  records  of  last  explosions  (Indian  and  Pakistani 
explosions)  is  not  visible  (see,  for  example,  Fig.  5).  Therefore,  it  is  reasonable  to 
use  numerous  characteristics  of  these  reconstructions.  The  most  popular  tool  for 
description  of  the  embedding  obtained  is  the  correlation  integral  12 . 

Let  y(i)  G  be  a  point  on  the  attractor  j4s^.  The  correlation  integral  is 
defined  as  proportion  of  pairs  of  points  of  no  more  than  distance  e  apart.  That  is, 

Cm(£)  =  n^T)  E  E  0(llyO)  -  y(fc)ll  -  *)  (4) 

3 

Here:  0  -  Heaviside  step  function,  the  symbol  ||  *  ||  always  denotes  ’sup’  norm 
on 

There  is  scaling: 


Cra(e)  oc  eu 


(5) 
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of  underground  explosions  the  main  part  of  seismic  energy  is  contained  in  congres¬ 
sional  P-waves  having  high  frequency  content  in  comparison  with  earthquakes.  At 
the  same  time  earthquakes  differ  by  generating  intensive  shear  S- waves.  These  dif¬ 
ferences  are  traditionally  used  for  choosing  diagnostic  parameters  for  seismic  event 
discrimination.  Hence,  we  studied  the  signals  for  both  types  of  waves  separately. 

We  processed  records  of  underground  nuclear  explosions  conducted  at  Semi- 
palatinsk  Test  Site  (STS, Kazakhstan)  and  Lop  Nor  (China)  and  earthquakes  in 
China  and  Altay  (Russia).  The  seismograms  were  recorded  by  Kazakhstani  seis¬ 
mic  stations  BRVK,  VOS,  ZRN,  CHK  located  in  the  North-West  and  TLG  -  in 
the  South  of  Kazakhstan.  In  addition,  seimograms  of  NIL  station  (Pakistan)  were 
processed. 

Our  experience  showed  that  correlation  integrals  for  P-wave  reconstructions 
are  more  informative  than  ones  for  S- waves:  analyzing  correlation  integrals  for  S- 
waves  we  didn’t  find  out  notable  features  discriminating  different  seismic  sources. 
Fig.  7  shows  the  typical  correlation  integrals  for  P-waves  of  nuclear  explosions  and 
earthquakes.  It  is  seen  that  the  correlation  dimensions  calculated  for  different  types 
of  seismic  events  are  different.  As  a  rule,  the  correlation  dimensions  are  higher  for 
explosions  than  for  earthquakes. 


Figure  6.  Slopes  of  correlation  integrals  (m  —  Figure  7.  Slopes  of  correlation  integrals  (m  = 
8  -  16,  r  =  2),  P-wave  of  the  nuclear  explosion,  7  -  14,  r  =  2),  P-wave  of  the  Chinese  earth- 
Semipalatinsk  Test  Site,  25.04.82.  quake,  24.07.86. 


In  general  case  the  slopes  exibit  complex  behavior  of  v  versus  e  Fig.  8.  For 
complex  systems  it  is  possible  that  more  than  one  scaling  exists  for  different  e. 
The  complexity  of  seismic  record  slopes  might  be  explained  by  presence  of  different 
kinds  folding  effects  in  17 .  For  the  reasons  outlined  above  we  used  the  slope’s 
form  instead  of  the  value  of  the  correlation  dimension,  i.e.  the  function  v  =  i/(log£). 

We  used  this  function,  as  discriminating  attribute  of  seismic  signals  for  training 
an  artificial  neural  network  to  classify  seismic  sources. 
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Figure  8.  Slopes  of  correlation  integrals  (m  =  10  -  25,  r  =  2),  P-wave  of  the  Indian  nuclear 
explosion  recorded  by  NIL  station  30.05.98. 

4  Neural  network  using 

In  the  last  decade  artificial  neural  networks  (ANN)  have  become  the  most  popular 
tool  for  pattern  recognition  tasks  19’20  in  general,  and  in  seismic  sources  distin¬ 
guishing  in  particular  3’4,5.  One  can  consider  a  neural  network  as  a  distributed 
dynamical  system  consisting  of  a  set  of  non-linear  processing  elements  (formal  neu¬ 
rons)  connected  according  to  a  certain  architecture.  The  formal  neuron  is  able  to 
pick  up  input  vectors  x  =  {xj},  j  =  1,  2, ...,  N,  estimate  their  scalar  multiplications 
with  weights  w  —  {wi},  i=  1,  2, ...,  M: 

S  —  y \jWj  (7) 

and  transform  S  in  accordance  with  an  activation  function  0  into  the  output  vec¬ 
tor  of  the  neuron  y  =  0(5).  Such  a  neural  network  might  be  trained  to  recognize 
unknown  patterns.  Network  training  process  is  based  on  adjusting  the  weight  con¬ 
nections  between  related  values  of  inputs  and  targets  (desirable  outputs)  so  as  to 
minimize  an  error  function  19 .  A  set  of  input- targets  values  is  called  a  training  one. 
There  exist  two  main  types  of  ANN:  fully-connected  nets  and  perceptrons  20 . 

In  order  to  check  the  algorithm  described  in  the  previous  section  we  trained 
the  fully-connected  artificial  neural  network  ’’MultiNeuron”  developed  by  Russian 
scientists  20  using  60  examples  of  the  correlation  integral  slopes  of  seismic  records 
mentioned  above  as  input  patterns.  The  net  was  tested  on  10  seismograms  of 
different  underground  nuclear  explosions  and  earthquakes  which  were  not  included 
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into  the  training  set.  Results  of  testing  showed  89%  of  the  correct  classification  of 
the  signals. 

5  Cross- correlation  sums  as  a  tool  for  distinguishing  between 

different  seismic  sources 

In  this  section  we  represent  results  of  our  study  of  non-linear  cross  correlation  sums, 
which  are  the  generalized  correlation  integrals  and  are  used  here  for  estimating 
similarity  of  two  probabilistic  measures. 

Let  { yi },  {zi},  z  —  1,2, ...,  N  denote  two  different  observed  time  series.  Internal 
dynamics  of  systems  in  a  phase  space  M  which  produces  these  series  as  typical 
mapping  M  — *  is  unknown.  However,  it  is  proposed  that  there  exist  chaotic  or 
quasiperiodic  attractors  of  these  systems,  dimensions  of  which  dy  and  dz  are  low 
enough  for  applying  the  reconstruction’s  methods. 

Let  and  A#  be  their  embeddings  into  3ftn,  n  >  2 (dy  +  dz).  Let  y  and  z  be 
corresponding  delay- vectors  selected  randomly  according  to  two  different  measures 
/i  and  p.  The  cross  correlation  integrals  are  defined  by15  : 

chp(£)  =  J  dMy)  J dp( z)©(e  -  ||y  -  z||)  (8) 

The  cross  correlation  sum  is  defined  similarly  by 

Cy2(£)=Ar2££0(e-||y-z||)  (9) 

y  z 

Almost  surely,  Cyz(e)  ->  Cpp(e)  for  N  -►  00,  and  Cyz(e)  is  an  unbiased  estima¬ 
tor  of  C^e).  For  sufficiently  small  e  and  for  absolutely  continues  measures  p  and 
p  one  can  show  15 

Clp{e)  <  C^(e)Cpp(e)  (10) 

In  practice,  this  inequality  is  used  to  calculate  the  cross  correlation  ratio  as  a 
similarity  measure 


r(e)  = 


Cyz  (e) 


JCyy(E)Czz(E) 


We  used  it  in  the  form 
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(11) 


rsm 

Ayz 


(*) 


/E w  llzW  -  z0')II2©(£  -  ||y(»)  -  y(i)||) 


Ei/je(e-  l|y(0  -y(j')H) 


(12) 


If  {yt}  and  {z*}  are  related  by  identical  dynamical  scenarios  than  one  can  expect 
that  ||y(i)  -y(j)||  <  £  =>  ||z(i)  -  z(j)\\  «  e.  If  no,  ||z(i)  -z(j)||  ss  Az,  where  Az  is 
the  average  distance  between  points  of  the  attractor  A'fK  and  if™  does  not  depend 
on  e. 


Fig.  9  represents  the  cross  corelations  of  various  seismic  events.  The  cross  corre¬ 
lation  between  the  Tibetian  earthquake  (09.01.90)  and  the  Indian  nuclear  explosion 
(11.05.98)  is  absent:  K^^e)  curve  is  practically  in  parallel  with  the  horizontal  axis. 
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Figure  9.  Cross  correlation  between  recon¬ 
structions  of  the  Tibetan  earthquake  and  the 
Indian  nuclear  explosion. 


log  e 


Figure  10.  Cross  correlation  between  recon¬ 
structions  of  the  Indian  and  the  Pakistani  nu¬ 
clear  explosions. 


The  Figure  10  demonstrates  strong  cross  correlation  between  the  Indian  explo¬ 
sion  (11.05.98)  and  the  Pakistani  nuclear  explosion  (28.05.98).  In  this  case  we  can 
conclude  that  dynamical  scenarios  of  two  systems  observed  are  identical  ones. 


6  Conclusion 

We  have  introduced  non-linear  criteria  for  distinguishing  between  underground  nu¬ 
clear  explosions  and  earthquakes  recorded  at  regional  distances  using  the  fractal 
approach. 

We  noted  that  behavior  of  Hurt’s  exponents  varies  for  graphs  of  different  seis¬ 
mograms  in  dependence  on  scale:  the  regularity  range  for  the  earthquakes  is  larger 
than  for  the  explosions. 

We  found  out  that  embeddings  of  non-camouflaged  explosions  into  3ft2  visually 
differ  from  ones  of  earthquakes.  The  slopes  of  correlation  integrals  demonstrate 
complexity  of  scaling.  The  notable  plateau  is  absent  for  both  types  of  seismic 
sources  but  the  curves  of  the  slopes  are  different  for  explosions  and  earthquakes. 

The  correlation  dimension  curves  of  60  seismograms  of  different  seismic  events 
were  entered  into  the  neuroimitator  ”  MultiNeuron” ,  which  was  used  as  a  classi- 
ficator  of  underground  nuclear  explosions  and  earthquakes.  The  testing  results 
obtained  have  shown  89%  correct  classification  of  10  seismic  events,  which  were  not 
included  into  the  training  set. 

Sometimes,  it  is  possible  to  identify  the  seismic  source  estimating  its  cross  cor¬ 
relation  with  a  test  example.  Analysis  of  relationships  of  attractors  in  the  phase 
space  allows  obtaining  additional  information  for  the  discrimination  task. 

Therefore,  our  experience  has  shown  that  the  non-linear  methods  could  be  suc¬ 
cessfully  applied  to  regional  seismic  event  discrimination. 
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There  is  verified  the  hypothesis  about  multifractai  nature  of  radioactive  contami¬ 
nation  due  to  nuclear  explosions  on  Semipalatinsk  test  site  (STS)  in  Kazakhstan. 
The  fields  of  terrestrial  contamination  have  extreme  high  variability  caused  by  a 
number  of  nature  processes.  There  are  used  the  methods  of  multifractal  formalism 
and  computation  topology.  The  results  of  analysis  support  the  existence  of  mul¬ 
tifractal  scaling,  which  is  different  for  natural  and  man-made  isotopes.  It  means 
that  there  are  ’’hot  spots”  of  man-made  Cs  isotope  contamination  dangerous  for 
human  health,  which  can’t  be  detected  by  the  traditional  in  Kazakhstan  technics 
of  measurements  from  air.  Moreover,  the  finding  of  self- similarity  could  be  the 
base  of  new  methods  of  diagnostics  of  large  territories. 


1  Introduction 


There  were  some  470  nuclear  explosions  on  STS  in  Kazakhstan,  of  which  90  ex¬ 
plosions  were  in  the  air,  25  on  the  ground  and  355  underground.  As  a  result  a  lot 
of  territories  were  contaminated  and  it  has  become  a  matter  of  serious  for  people 
with  regard  to  its  consequences  for  the  lives  and  health  1 .  We  analyze  the  resulting 
contaminated  areas  by  examining  the  distribution  of  radionuclides.  The  analysis 
of  the  problem  is  not  trivial  and  is  influenced  by  a  number  of  issues,  such  as  wide 
range  of  contamination,  extending  from  meters  to  hundreds  of  kilometers;  presence 
of  a  mixture  of  radioactive  isotopes,  due  to  numerous  nuclear  explosions  of  vari¬ 
ous  composition  and  power;  highly  variable  rate  of  migration  of  radionuclides  in 
soils;  the  influence  of  climatic  conditions  on  the  localisation  of  contamination  and 
its  transport  2 .  So,  the  radioactive  contamination  of  the  ground  is  a  result  from 
nonlinear  interplay  of  geophysical  fields  which  intervene  over  a  large  range  of  scale. 
Consequently,  fields  of  contamination  observed  have  an  extreme  spatial  variability, 
frequently  cited  ’’hot  spots”  or  ’’leopard’s  skin”  3. 

Complex  structure  of  the  field  is  the  main  difficulties  of  it’s  diagnostics.  The 
absence  of  the  field  smoothness  restricts  a  choice  of  methods  of  measurements  and 
data  processing.  The  maps  of  contamination  are  designed  using  interpolation  of 
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measurements  and,  consequently,  depend  on  a  scale  of  averaging.  As  a  result, 
sparse  anomalies  can  be  missed. 

To  detect  all  fluctuations  of  the  contamination  field  measurements  must  cover 
a  territory  by  a  network  of  overlap  neighborhoods4.  But  it  is  impossible  for  huge 
areas  of  former  nuclear  weapon  testing  site,  spanning  an  area  of  around  18500A;77i2 
and  regions  bordering  this  test  site  which  we  deal  with.  Nowadays,  only  1%  of 
contaminated  territory  has  been  investigated  by  accepted  in  Kazakhstan  methods 
of  measurements  from  air  and  it  is  definitely  not  enough  to  describe  the  radioactive 
contamination  and  to  implement  effective  decontamination  methods.  The  scale- 
symmetry  assumption  is  the  simplest  and  also  only  one  acceptable  in  the  absence 
of  knowledge  in  specific  mechanisms  producing  these  fields.  The  idea  of  scaling 
invariance  can  be  useful  in  practice  and  could  become  a  base  for  the  analysis  and 
diagnostics  of  the  areas. 

The  present  study  deals  with  the  area- analysis  of  the  distribution  of  radionu¬ 
clides  in  Kazakhstan.  The  data  is  obtained  primarily  from  former  STS  and,  sec¬ 
ondly,  from  regions  bordering  this  test  site.  Experimental  data  consist  of  radionu¬ 
clide  measurements  made  from  the  air,  on  a  grid  of  parallel  lines  covering  a  part 
of  territory.  Two  regions,  Karaganda  and  Semipalatinsk,  and  Irtysh  test  site  were 
explored. 

It  is  possible  to  analyze  these  measurements  by  several  complementary  methods, 
among  them  the  fractal  5,  multifractal  3,6  and  morphology  methods  7 .  Here,  we 
present  multifractal  method  8,9  and  method  of  computational  topology  10 . 

The  structure  of  this  paper  is  as  follows.  In  Sec.  2  we  recall  some  definitions 
of  multifractal  analysis  and  apply  this  technique  to  the  data  from  flight  paths.  In 
Sec.  3  we  describe  the  method  of  computational  topology  and  present  the  results 
of  applying  this  method  and  multifractal  one  to  data  from  the  Irtysh  test  site.  The 
summary  is  found  in  the  concluding  section. 

2  Multifractal  analysis  of  radioactive  contamination 

We  used  an  approach  in  which  the  contamination  data  along  a  path  are  considered 
as  multifractal  random  measures.  Let  us  remind  8’9,n,  that  multifractal  spectrum 
of  singularities  a  of  Borel  finite  measure  fi  on  a  *compact*  set  X  is  a  function  f(a) 
defined  by  pair  (g,  G).  Here,  g  :  X  — >  [— oo,  oo]  is  a  function,  which  determines  the 
level  sets: 

K  =  {  x£X:  g(x)  =  a}  (1) 

and  produces  a  multifractal  decomposition  X : 

*=  U  K&-  (2) 

— 00<Q!<00 

Let  G  be  a  real  function  which  is  defined  on  Zi  C  X  such  as  G(Z\)  <  G(Z2)  if 
Z\  C  Z2.  Then  multifractal  spectrum  is  f(a)  =  G(K%).  Let  g  be  determined  as 
pointwise  dimension  d ^  of  measure  fi(x)  at  all  points  x  £  X  for  which  the  limit 

9  =  dfs(x)  =  lirn  log  fi(B(x,  r))/  log r, 

r — >0 


(3) 
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exists,  where  g{B(x,  r))  is  a  ’’mass”  of  measure  in  the  ball  of  radius  r  centered  at  x. 
Since,  we  have  chosen  g  =  we  can  drop  the  g  subscript  from  further  references 
to  Kg.  Then  Ka  =  {x  :  d^{x)  =  a},  where  exponent  a  is  local  density  of  g.  The 
singular  distribution  g  can  then  be  characterized  by  giving  the  ’’size”  of  sets  Ka 
by  their  Hausdorff  dimension,  i.e.  /(a)  =  G(Ka)  =  dimH{Ka).  If  g  is  self-similar 
in  some  sense,  f(a)  is  a  well  behaved  concave  function  of  a  9.  To  estimate  f(a)  for 
path  data  we  applied  the  method  of  the  partition  sum  11 . 

Let  X  =  5s  =  [0, 1]  and  kn  be  an  increasing  sequence  of  positive  integers.  Define: 


li.n  — 


r  i  i  +  ll 


(4) 


The  component  of  multifractal  decomposition  $  is 

log  g(Iitn(x)) 


Ka  =  {x  e  \  lim 


log  k7 


=  a}. 


(5) 


It  is  more  easy  to  analyze  ’’coarse  grained”  Holder  exponents  12  a  =  an  of  g  on 
li.n’ 


lo  gg(Ij,n) 
lo  gkn 


Their  variations  are  described  by  Legendre  spectrum  13 : 

f{a)  =  fL(a)  =  inf (aq  -  r(q )), 

Q 


(6) 


(7) 


where 


r(q)  =  lim 

n— >oo 


1 

log  kn 


log^W^.n)]* 

i 


(8) 


and  summation  runs  through  i  such  that  g{Ii,n)  ^  0* 

We  analyze  the  radionuclide  contamination  data  measured  from  the  air  for  Kara¬ 
ganda  and  Semipalatinsk  regions.  The  height  of  all  flights  was  about  50  m  and  the 
velocity  of  the  plane  was  about  125  km/h.  Spectrometrical  measurements  were  usu¬ 
ally  made  at  time  intervals  of  1  sec.  The  technique  used  is  based  on  measurements 
of  7-quanta  flow  density  of  214Hz  (1.12  and  1.76  Mev)  to  determine  the  contam¬ 
ination  by  U,  208Tl  (2.62  Mev)  to  determine  Th  contamination,  *°K  (1.46  Mev) 
to  determine  K  and  137 Cs  (0.66  Mev)  for  Cs  contamination.  Total  7- activity  is 
measured  in  the  range  of  0.25-3.0  Mev.  The  spectrometer  was  equipped  by  Nal 
element.  The  measurements  have  been  done  at  different  scales,  e.g .,  in  Karaganda 
and  Semipalatinsk  region  the  scale  is  1:1,000,000,  so  the  distance  between  paths 
is  10  kms.  There  are  up  to  11000  data  points  along  the  paths,  with  each  reading 
separated  by  50  meters.  A  sample  of  measured  data  can  be  found  in  Fig.  1. 

For  calculation  of  multifractal  spectrum  it  is  necessary  to  transform  contamina¬ 
tion  values  Y  into  probabilistic  measures.  As  a  rule,  empirical  probability  density 
function  p(Y)  behaves  itself  as  a  curve  (1)  shown  in  Fig.  2,  where  function  tails 
fall  down.  The  path  data  Y  were  transformed  to  new  variables  Z  which  have  rect¬ 
angular  random  probability  density  function  (the  curve  2  in  Fig.  2)  by  solution  of 
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Figure  1.  Data  array  of  Cs ,  Semipalatin.sk  region  path 


bin 


Figure  2.  The  histograms  of  distribution  contamination  data  (1)  and  probabilistic  measures  (2). 


equation  P(Y)  =  P(Z),  where  P  is  a  probability  function.  In  the  case  of  rectangu¬ 
lar  random  distribution  Z  =  P(Y).  This  transformation  is  equivalent  to  coordinate 
change  which  saves  a  dimension  spectrum  /(a)  14 . 

The  range  of  q  value  was  chosen  as  [—6,4].  The  Fig.  3  displays  the  set  of 
multifractal  spectra  of  radionuclide  contamination  of  paths.  The  scaling  of  all 
isotopes  obeys  to  multifractal  law.  The  behaviors  of  f(a)  curves  are  different  for 
various  isotopes.  In  Fig.  4  it  is  shown  the  set  of  f(a)  spectra  of  man-made  isotope 
Cs  data  calculated  for  different  paths.  It  can  be  seen  that  scaling  properties  differ 
from  path  to  path,  but  this  scaling  variability  is  not  so  high  as  distinction  of  f(a) 
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a  distance  between  adjacent  paths  is  ^  10km  and  effective  strip  of  equipment 
capture  is  ~  300m.  For  the  case  of  measurements  paving  the  territory  we  can 
use  the  method  of  computational  topology  described  in  the  next  section,  which 
complements  multifractal  method. 


3  Computational  topology  of  the  radionuclide  fields: 
disconnectedness  index. 


The  multifractal  formalism  can  be  applied  to  area  contaminations,  where  a  support 
of  measure  y{x,y)  is  a  compact  set  X  =  [0, 1]  x  [0, 1]  €  &2.  In  this  case  13  7i>n  -> 
where 


'  i  if  T 

X 

'  i 

i  +  T 

-  hn  kn 

V' 

L^n 

kn 

Unfortunately,  contamination  data  do  not  allow  to  use  square  paving,  as  the  dis¬ 
tance  between  the  data  points  on  adjacent  paths  is  always  larger  than  the  distance 
between  the  neighbouring  points  along  a  single  path.  It  could  be  used  the  tessela- 
tion  of  the  surface  by  rectangles  {c*}  15 .  In  this  case  probability  measure  on  d  is 
averaged  over  rectangle,  such  as,  y(x,  y)  :  x}  y  G  d  presents  itself  step-like  function, 
which  has  a  non- zero  constant  value  at  each  rectangle.  The  components  of  ’’coarse- 
graining”  multifractal  decomposition  Ka  are  found  by  selecting  of  rectangles  c*  of 
Holder  exponent  a±  Aa,  where  A  a  is  discrete  step  of  the  measure.  The  coarse 
Holder  exponent  is 


a{ci)  = 


logy(cj) 

log  |c| 


where  |c|  is  a  size  of  q.  ”  Coarse-graining”  version  f(a)  is 


(10) 


fc(a) 


lim  lim 

Aa- >0  S— ►[cl 


log  Ns  (a 7  Aa) 
log(l  fS) 


(11) 


where  Ns(a,  Aa)  denotes  the  number  of  boxes  of  6  size  with  a(c*)  ~  a.  This 
function  can  be  explained  in  statistical  terms  12  as  probability  ps  of  finding  a(cj)  ~ 
a  and  behaves  roughly  like 


Ns(a)/Ns  =  ps  a(d) 


a 


~  fiD-fG(a) 


(12) 


Here,  Ns  is  the  total  number  of  non-empty  5-boxes  which  contain  measure,  D 
denotes  the  box  dimension  of  the  support  of  p. 

For  estimation  of  /g(o)  spectrum  it  could  be  used  functional  box-counting  15, 
however  the  covering  by  rectangle  neighborhoods  makes  it  impossible  to  re-scale 
rectangle  edges  uniformly  and  introduces  additional  local  and  global  dimensions 
which  have  no  clear  meaning  16 .  Consequently,  we  use  computational  topology  for 
calculation  of  the  box  dimension 

The  component  Ka  of  multifractal  decomposition  is  a  subset  {c*}  having  the 
same  a±Aa.  The  ’’coarse-graining”  geometry  of  Ka  can  be  described  by  the  rate 
of  growth  a  number  of  d  e  Ka  with  improving  resolution  10  and  called  disconnect¬ 
edness  index  7. 


Figure  5.  The  paving  map  of  U  contamination  of  Irtysh  test  site. 


A  topological  space  X  is  connected  if  and  only  if  it  cannot  be  decomposed  into 
the  union  of  two  non-empty,  disjoint,  closed  sets.  If  such  a  decomposition  exists 
then  X  is  said  to  be  disconnected-  that  is,  if  there  are  two  closed  sets  U  and  V 
such  that  Uf]V  =  0  and  U\ JV  -  X.  There  was  reformulated  the  notion  of 
connectedness  in  order  to  make  it  possible  to  implement  a  test  for  this  property  on 
a  computer.  The  basic  idea  is  to  look  at  the  set  with  a  finite  resolution  e  and  see 
how  connectedness  changes  as  we  let  e  — ►  0.  It  is  said  that  a  subset  X  of  metric 
space  is  £  -  disconnected  if  it  can  be  decomposed  into  sets  that  separated  by  a 
distance  of  at  least  e. 

Let  Ne  (a)  be  the  number  ^-components  by  given  a  resolution  £  and  intensity  of 
measure  equals  a  ±  Aa.  The  disconnectedness  index  7  is  the  following  limit: 


7 


—  lim  inf 
£ — *0 


logN£(a) 

log( l/e)  ' 


(13) 


The  index  equals  to  Hausdorff  dimension  for  the  simplest  sets,  for  example  for 
the  middle- thirds  Cantor  set.  However,  more  generally  the  disconnectedness  index 
distinguishes  between  sets  with  the  same  box-counting  dimension. 


3. 1  Scaling  properties  of  Irtysh  area  contamination. 

There  have  been  applied  both  multifractal  formalism  and  computational  topology 
to  Th ,  K ,  U,  Cs  isotope  contamination  of  Irtysh  area  which  is  the  part  of  STS.  Irtysh 
area  was  chosen  for  calibration  of  the  devices  for  spectrometrical  measurements  from 
air  and  was  investigated  more  thoroughly.  The  distance  between  paths  was  50  m, 
effective  strip  of  equipment  capture  was  ~  50m  and  there  were  1197  measurements 
on  the  ground. 

To  calculate  7  index  a  range  of  values  for  each  isotope  have  been  divided  into 
255  levels.  The  contamination  values  is(ci)  are  expressed  in  the  share  of  the  whole 
range  which  was  reduced  to  1.  In  Fig.  5  a  part  of  the  digital  image  of  contamination 
used  for  7  computation  are  shown. 

The  curves  of  disconnectedness  index  (Fig.  7)  of  natural  isotopes  have  plateaux 
with  7  =  2  at  v  G  [0.3  4-  0.5]  for  Th ,  v  £  [0.3  4-  0.45]  for  K  and  v  £  [0.3  4-  0.35]  for 
U  and  the  7  values  decrease  as  v  increases.  The  behavior  of  man-made  Cs  isotope 
is  different.  The  index  of  disconnectedness  7  is  equal  to  1.3  at  a  small  v  and  slowly 
decreases  demonstrating  two  plateaux:  7  =  1  at  v  E  [0.454-0.7]  and  plateau  7  0.6 

at  v  >  0.8  (438  milliCurie/fcm2).  The  latter  means  the  possibility  of  existence  of 
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’’hot  spots”  of  Cs  contamination  which  lie  in  the  area  of  highest  concentrations 
(>  43 SmilliCurie/km2)  and  small  fractal  dimensions  7  <  0.6.  In  general,  y(v)  can 
be  interpreted  as  ”  pre- multifractal”  spectrum  /g(o:).  In  this  case,  the  probability 
of  existence  of  ’’hot  spots”  decreases  with  spot  size  6  as  oc  (cf.  Eq.  12).  It 

is  necessary  to  notice  the  similar  behavior  of  Cs  cumulative  soil  deposition  data  in 
Chechslovakia  after  Chernobyl  accident  6. 

Statistical  properties  of  multifractal  fields  are  a  function  both  of  a  scale  and 
a  dimension  of  a  support  of  measurements.  It  is  known,  that  measurement  net 
with  empirical  box-dimension  Dnet  cannot  detect  sparsely  distributed  phenomena 
with  box-dimension  Da  <2  —  Dnet  17 .  This  conclusion  follows  from  the  common 
ideas  about  tranversality  18 .  It  is  necessary  remember,  that  two  sets  M  and  N  of 
dimension  Dm  and  D n  intersect  transversally  in  if 

codim(M  D  N)  =  codimDM  +  codimD^.  (14) 

So,  the  net  with  dimension  Dnet  can  detect  a  set  of  dimension  Da  only  if  Dnet-\-Da  > 
m.  As  a  rule,  box-dimension  of  the  net  of  spectrometrical  measurements  from  air  is 
Dnet  <  2,  consequently,  the  number  of  anomalies  on  contaminated  territories  can 
be  missed. 

In  addition  to  computation  topology  method  there  was  used  multifractal  formal¬ 
ism  for  estimation  of  f(a)  spectra  of  Irtysh  test  site  data.  There  was  constructed 
one  dimension  array  by  linking  all  paths  in  one.  Adjusted  paths  were  ’’glued”  to¬ 
gether  by  taking  into  account  their  orientation  and  adding  the  beginning  of  the 
following  path  to  the  end  of  previous  one.  Such  arrays  were  formed  for  all  isotopes 
and  multifractal  spectra  were  computed.  The  results  of  this  analysis  are  shown  in 
Fig.  6.  The  comparison  of  results  of  both  methods  applied,  to  Irtysh  data,  points 
at  good  concurrence  of  isotope  discrimination  on  their  scaling.  The  qualitative 
behaviour  /(a)  and  7(2/)  (Fig.  7)  curves  are  the  same.  Man-made  isotope  Cs  has 
demonstrated  the  most  large  Holder  exponent  a  ~  1.7  in  multifractal  spectra.  It 
means  that  there  are  hot  spots  of  Cs  contamination  which  form  a  sparse  set  having 
dimension  f(a)  <  0.2. 

4  Conclusion 

All  the  results  of  the  multifractal  analysis  that  we  obtained  support  the  existence 
of  multifractal  nature  in  the  terrestrial  radionuclides  contamination  in  the  two 
investigated  regions  and  Irtysh  area.  The  analysis  made  it  possible  to  distinguish 
radionuclide  isotopes  K ,  Th ,  £/,  Cs ,  specially  for  measurements  made  on  Irtysh  test 
site. 

Using  the  method  of  computation  topology  for  the  case  of  measurements  paving 
the  territory,  a  pre-multifractal  spectrum  of  dimensions  (indexes  of  disconnected¬ 
ness)  for  Irtysh  area  was  calculated.  It  was  found  that  the  behavior  of  disconnect¬ 
edness  index  of  natural  isotopes  differs  strongly  from  Cs  one.  Contamination  of 
natural  radionuclides  exhibits  7  ^  2  for  low  values  and  dimension  decreases  quickly 
with  increasing  of  level.  Spectrum  of  dimensions  for  Cs  points  out  the  existence  of 
’’hot  spots”  of  contamination. 

Taking  into  account  the  multifractal  properties  of  man-made  Cs  found  for  Irtysh 
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area,  it  can  be  pointed  out  the  existence  of  dangerous  for  human  health  anomalies 
on  all  contaminated  territories.  These  ’’hot  spots”  cannot  be  detected  by  using 
spectrometrical  measurements  from  air  accepted  in  Kazakhsatan.  The  results  ob¬ 
tained  can  be  the  background  of  new  methods  of  diagnostics  of  large  territories. 
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PATTERN  SELECTION:  NONSINGULAR  SAFFMAN-TAYLOR 
FINGER  AND  ITS  DYNAMIC  EVOLUTION  WITH  ZERO 
SURFACE  TENSION 
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By  a  slight  modification  of  Saffman-Taylor’s  viscous  finger,  we  remove  its  singu¬ 
larity  and  present  a  finite-length  nonsingular  Saffman-Taylor  finger.  Its  dynamic 
evolution  solutions,  both  one-step  theory  prediction  and  numerical  simulation  in 
the  long-time  limit  with  zero  surface  tension,  give  an  alternative  proof  for  the  an¬ 
alytical  selection  demonstration  of  the  Saffman-Taylor  finger  width  in  the  absence 
of  surface  tension.  As  a  simple  example  of  pattern  selection,  which  has  real  exper¬ 
imental  background,  this  work  not  only  contradicts  the  generally  accepted  belief 
that  surface  tension  is  indispensable  for  the  selection  of  the  ^-width  finger  but  also 
provides  more  models  to  compute  the  evolution,  competition  and  ramification  of 
multiple  fingers  numerically  in  straight  channel  as  well  as  circular  disc  geometry. 


1  Introduction 

The  Saffman-Taylor  problem  1  has  played  a  central  role  in  the  study  of  viscous 
fingering  in  a  Hele-Shaw  cell  2,  which  was  modeled  by  two-dimensional  potential 
flow  at  the  interface  between  two  fluids.  Saffman  and  Taylor  found  analytically  a 
continuous  family  of  steady-state  solutions,  which  shows  fingers  of  different  width 
could  exist  in  the  absence  of  surface  tension,  but  their  experiments  with  negligible 
surface-tension  effect  in  1958  and  numerical  calculations  made  by  McLean,  Saffman 
and  Vanden-Broeck  in  the  presence  of  surface-tension  effect  in  1981  and  1983  showed 
no  hint  of  the  continuous  family  3,4 .  On  the  contrary,  the  finger  with  relative  width 
A  =  0.5  is  always  singled  out  at  the  zero  surface  tension  limit.  This  problem 
(which  is  so  called  Saffman-Taylor)  has  been  of  much  subsequent  interest  because 
it  is  universal,  i.e.,  the  same  selection  phenomenon  is  common  for  displacement  of 
various  viscous  liquids  by  less  viscous  ones  for  immiscible  incompressible  liquids. 
Much  work  5  was  done  toward  solving  this  Saffman-Taylor  puzzle.  However,  it 
has  been  widely  accepted  that  the  inclusion  of  surface  tension  is  the  only  way 
to  select  the  most  stable  finger  width  since  Saffman-Taylor  proposed  that  surface 
tension  between  the  two  fluids  would  solve  the  selection  problem  1 .  Although  several 
works  6  confirmed  numerical  evidence  of  the  discrete  spectrum  of  A,  decreasing  to 
in  the  limit  of  low  surface  tension,  exact  solutions  are  rare  with  zero  surface  tension. 
Recently,  Mineev- Weinstein  7  analytically  solved  the  finger  selection  problem  in  the 
absence  of  surface  tension.  He  showed  an  exact  result  that  a  generic  interface  in 
a  Hele-shaw  cell  evolves  to  nonlinearly  stable  single  uniformly  advancing  finger 
occupying  one- half  of  the  channel  width,  which  contradicts  the  generally  accepted 
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Figure  1. 


belief  that  surface  tension  is  indispensable  for  the  selection  of  the  |-width  finger. 

We  find  that  the  problem  of  the  finger  width  selection  stems  from  the  singularity 
of  Saffman-Taylor  finger,  which  is  half  infinite  in  length.  The  singularity  makes  the 
numerical  calculation  impossible  and  is  not  in  agreement  with  the  experimental 
results.  We  introduce  a  positive  variable  e(t)  to  remove  the  singularity.  By  the 
conformal  mapping  method  presented  by  Bensimon  8,  we  predict  that  there  exists 
a  unique  steady  width  A  =  0.5  of  the  viscous  finger  to  which  the  fingers  of  other 
width  approach  in  the  absence  of  surface  tension  after  a  one-step  analysis.  Then  we 
verify  this  prediction  by  numerical  computation  and  show  its  long  time  evolution. 


2  Nonsingular  Saffman-Taylor  Finger  and  One-step  Dynamic 

Evolution  Theory 

Following  Saffman  and  Taylor  we  consider  the  analysis  of  a  long  air  bubble  or 
finger  moving  through  a  channel  in  the  Hele-Shaw  cell  bounded  by  two  straight 
parallel  walls.  We  take  the  y-axis  (Fig.  1)  perpendicular  to  the  walls  and  the  x-axis 
along  the  central  axis  of  symmetry  of  the  channel.  The  separation  of  the  walls  is 
normalized  to  2i r  and  the  velocity  of  the  fluid  at  infinity  in  front  of  the  finger  is 
normalized  to  unity.  Let  z  =  x  +  iy  and  $  =  <j>  +  <j>(z)  and  ip(z)  being  the 

velocity  potential  and  stream  function,  respectively. 

In  case  of  the  absence  of  surface  tension,  the  interface  is  described  by  0  =  0, 
which  can  be  conformally  mapped  to  the  circumference  of  a  unit  disk  in  the  complex 
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plane  C  =  e~*.  By  these  notations  the  Saffman-Taylor  solution  1  can  be  written  as 

2  =  2(1-  A )Jni  (e-*+  +  1)  +  iif>,  (1) 

where  the  single  parameter  A  depicts  the  width  of  the  finger.  Saffman-Taylor  finger 
is  half  infinite  in  length  and  the  solution  is  steady  in  the  frame  co-moving  with  the 
finger  tip  which  has  a  speed  \  relative  to  the  laboratory  frame. 

In  order  to  remove  the  singularity  at  ip  —  ±7r  in  Eq.  (1),  we  choose  the  solution 
in  the  following  form  in  the  laboratory  frame: 

z  =  2(1  —  A)in-(C  +  1  +  e)  —  InQ  +  —  +  z(C>  P) 

where  e  =  e(f)(=e0  at  t= 0)  is  a  small  positive  real  number  varying  with  time  t , 
z(Q}  t)(=0  at  t=0)  describes  the  possible  additional  time-dependence  of  the  finger. 
The  functional  form  of  e(t)  and  z( t)  are  to  be  determined  by  the  evolution  equa¬ 
tion  of  interface.  In  Eq.  (2)  the  singularity  of  Saffman-Taylor  solution  at  Q  =  —1 
on  the  unit  circle  is  moved  to  outside  of  the  unit  disk  and  therefore  the  length  of 
the  finger  becomes  finite  (Fig.l). 

The  corresponding  equation  of  the  boundary  of  the  nonsingular  Saffman-Taylor 
finger  is 

2  =  2(1  -  A )lnhe~**  +  1  +  e)  +  hji  +  |  +  z{i>,  t)  (3) 

At  moment  t,  the  finger  tip  is  located  at  ip  =  0,  (j>  =  0  and  the  rear  end  at  ip  = 
±7 r,  4>  =  0,  therefore  the  length  of  the  finger  is 

L(t)  =  x0  —  *i  =  2(1  -  A)ln(l  +  -)  (4) 


In  our  notation  Bensimon’s  equation  8  in  case  of  the  absence  of  surface  tension  is 
written  as 


dz 

di 


(5) 


where 

GO 0  =  A{gm ,  m  =  Or-  («) 

and  the  suffix  T  means  that  the  value  of  the  function  in  the  square  parentheses  is  to 
be  taken  on  the  unit  circle  where  C  =  e_i^,  A  is  an  operator  analytically  continuing 
the  real  function  g(V>)  defined  on  the  circumference  of  the  unit  circle  as  the  real 
part  of  complex  function  G(C)  in  the  interior  of  it.  The  analytical  continuation  is 
performed  by  Fourier  expansion. 

When  t=0,  Eq.  (2)  becomes 

2  =  2(l-A)lni(C  +  l  +  eo)-K  C7-1) 

and  the  equation  of  the  nonsingular  Saffman-Taylor  finger  is 

2  =  2(1  -  A)Znhe-^  +  1  +  e0)  +  hp 


(7.2) 
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,,,  =  1  +  2(1  +  e0)cosip  +  (1  +  e0)2 

(1  -  2A)2  -  2(1  -  2A)(1  +  eo)cosi>  +  (1  +  e0)2 
taking  a  trial  solution  as 

Q(s\  _ _ a  +  K _ 

(1  —  2A)C  —  (1  +  €o) 

with  a,  b  to  be  decided  ,  we  have 

Rprlrv  -  (b  +  a)(l  -  2A)  -  Ml  +  ep )cosip  -  a(l  +  £0) 


i?eG(C)|r  = 


Equating 


we  get 


(1  -  2A)2  -  2(1  -  2A)(1  +  eo)cos^  +  (1  +  e0)2 


ReG{ip)\r  =  g(ip) 


b  (1  —  2A)  +  (1  —  2A)(1  +  £0)2  +  2(1  +  £0)2 
(1  -  2A)2  -  (1  +  e0)2 

r  (2(1  —  2A)  +  (1  +  £0)2  +  1)(1  +  £0) 

(1  -  2A)2  -  (1  +  e0)2 

Substituting  eqs.(12)  and  (13)  in  G(£),  we  get 

—C—G(C)  =  —  —  (1  ~  A)(l  +  £p)[l  —  (1  +  £p)2] 

A  A[(l  —  2A)2  —  (1  +  e0)2)(C  +  1  +  e0)  _ 

(1  —  2A)(1  —  A)[l  —  (1  +  f0)2|C 
A[(l  —  2A)2  —  (1  +  e0)2](C  +  1  +  eo) 

Meanwhile,  from  Eq.(2)  at  time  t,  we  have 

dz  _  1  2(1  -  A)  de o  dz 

dt  ~  \  +  Q  +  l  +  e  dt  +dt 

Comparing  Eq.(14)  and  Eq.(15),  we  have 

2(1  —  A)  de o  _  _  (1  —  A)(l  +  £o)[l  —  (1  +  eo)2] 

C  +  1  +  «o  dt  ~  A[(l  -  2 A) 2  -  (1  +  e0)2](C  +  1  +  eo) 

dz  =  _  (1  -  2A)(1  -  A)[l  -  (1  +  €0)2]C 

dt  A[(1-2A)2-(1+£o)2](C+1+€o) 

and  at  the  boundary 

—  \v  ~  (1  -  2A)(1  —  A) [1  —  (1  +  e0)2]{[l  +  (1  +  eo)cosijj\  —  i(  1  +  6p jsimp} 

^  ^[(1  —  2A)2  —  (1  +  Q})2][1  +  2(1  +  eo)co5^  +  (1  +  eo)2] 


In  the  first  time  step  we  have 


^  dz 

A‘ -  mAt 
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thus 

dz  A  (1  -  2A)(1  -  A) [1  -  (1  +  6q)2][1  +  (l  +  €0 )cosj>] 

~  RedtAt  "  A[(l  -  2A)2  -  (1  +  60)2][1  +  2(1  +  e0)cosip  +  (1  +  e0)2l 

_  dz  (1  —  2A)(1  —  A)  [1  -  (1  +  e0)2](l  +  eQ)sini) 

Ay  =  Irn—At^  _  2A)2  _  (1  +  €o)2][1  +  2(1  +  e0  W  +  (1  + 

Choosing  a  suitable  eo  from  Eqs.(20)  and  (21),  we  can  see 

•  If  originally  A  >  0.5,  then  Ax  >  0,  A y<  0,  the  finger  is  being  stretched  to  be 
longer  and  thinner  in  later  time,  i.e.,  A  tends  to  decrease. 

•  If  originally  A  <  0.5,  then  Ax  <  0,  A y>  0,  the  finger  is  being  compressed  to 
be  shorter  and  wider  in  later  time,  i.e.,  A  tends  to  increase. 

•  If  originally  A  =  0.5, then  Aa;  =  0,  Ay  =  0,  the  finger  keeps  its  width  forever. 

Thus  in  our  one-step  dynamic  evolution  theory  about  the  evolution  of  nonsingular 
Saffman-Taylor  finger,  we  conclude:  There  exists  a  unique  steady  width  A  =  0.5 
of  the  viscous  finger  to  which  the  fingers  of  other  width  approach.  This  is  exactly 
what  Saffman  and  Taylor  observed  in  the  experiment  in  1958. 

3  Numerical  Computation 

From  the  one-step  analysis  described  above  we  can  not  predict  completely  the 
shape  change  in  the  long  run,  however,  the  evolution  may  be  obtained  by  numerical 
computation.  At  time  t,  if  z(ip,  t)  =  x(ip,  t)  +  iy( if>,  t )  is  given,  considering  Q  =  e  ^'p 
we  have 

[C^c^lr  =  id# z 

substitution  of  eq.(22)  in  eq.(6)  yields 

9W  =  (cVy)2  +  (^.s)2 

According  to  the  Poisson  integral  formula^,  if 

oo 

g(4 ))=a0  +  ^(a„ei^  +  <e-<^) 

n— 1 

we  have 

oo 

G(C)  =  ^{fl(V>)}  =  ao  +  2^(a„Cn) 

n= 1 

Noticing  the  symmetry  of  the  finger  and  substituting  Eqs.(22)  and  (23)  into  Ben- 
simon  equation  (5),  we  get  the  nonlinear  partial  differential  equation 

oo 

dtz=-[  2]T(a  nsinnip)  +  ig(ip)}d,j,z 
n= l 


(22) 

(23) 

(24) 

(25) 


(20) 

(21) 


(26) 
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Figure  2.  Stable  evolution  of  NST  with  different  X's,eo  =  0.3 


where 


_  i  r 

an  —  / 

7T  Jo 


g(i>) 


cosnipd'ip 


If  the  initial  condition  z(i>,  0)  =  x(-0,O)  +  iyty,  0)  is  given,  we  can  compute  the 
equation  (26)  with  various  A  and  get  a  family  of  evolution.  Fig.  2  shows  the  evolu¬ 
tion  of  nonsingular  Saffman-Taylor  finger.  It  can  be  seen  from  the  figure  that  the 
above  conclusions  are  valid  in  the  long  run  of  evolution. 


4  Conclusions  and  Prospect 

Using  the  conformal  mapping  method  presented  by  Bensimon,  we  present  a  non¬ 
singular  Saffman-Taylor  finger.  Both  theoretical  and  numerical  solutions  verify  the 
pattern  selection:  the  tracing  finger  always  tends  to  ^ -width  finger  in  the  long  run 
with  zero  absence  surface  tension  which  resolves  the  Saffman-Tay  lor ’s  puzzle.  Being 
a  simple  example  of  pattern  selection  which  has  real  experimental  background,  this 
work  contradicts  the  generally  accepted  belief  that  surface  tension  is  indispensable 
for  the  selection  of  the  width  finger,  a  result  in  parallel  to  paper  in  ref  7 . 
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Figure  3.  Straight  multiple  fingers  (left)  and  round  fingers  (right) 


Futhermore,  the  solution  (2)  of  single  finite  finger  can  be  generalized  to  cases  of 
multiple  fingers 

2(lI-A)^lje_n$+i(n_1)5r  +  1  +  ej  +  3)  (28) 

n  2 

and  of  round  fingers 

Inz  =  2(1  -  A^nhe~n*+i(n~1),r  +  1  +  e]  +  $  (29) 

n  2 

(see  Figs.  3  and  4  respectively),  may  produce  more  interesting  pattern  selection 
problems  of  evolution,  competition  and  ramification  of  multiple  fingers  in  straight 
channel  as  well  as  circular  disc  geometry. 

It  should  be  mentioned  that  Bensimon’s  equation  is  a  highly  nonlinear  equation 
which  meets  computational  difficulty.  Tiny  round-off  errors,  especially  the  residual 
errors  in  summing  up  the  Fourier  series  of  Eq.(26)  will  be  magnified  quickly.  Special 
efforts  has  been  made  to  reduce  these  errors  to  extremely  low  level  in  order  to  make 
the  computation  possible,  which  has  been  completed  10 .  If  surface  tension  exists,  we 
can  compute  Bensimon’s  equation  in  the  same  way,  but  special  method  of  numerical 
computation  must  be  designed. 

Pattern  selections  of  nonsingular  Saffman-Taylor  finger  in  the  presence  of  surface 
tension  and  multi-fingers  constitute  our  further  work. 
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Generally,  the  irregular  behaviour  of  a  function  f(x)  is  described  by  its  Holder 
exponent  h(x)  quantifying  the  strength  of  the  singularity  at  a  point  x.  In  most 
cases,  where  the  function  does  not  contain  oscillating  singularities,  the  Wavelet 
Transform  Modulus  Maxima  method  (WTMM)  allows  a  reliable  estimation1’2  of 
the  singularity  spectrum  D(h ),  i.e.  the  Hausdorff  dimension  of  the  set  of  all  points 
x  with  the  same  Holder  exponent  h(x)  =  h.  Methods  based  on  the  WTMM  should 
be  numerically  more  stable  than  direct  methods  such  as  the  Structure  Function 
method1,  since  they  involve  only  weighed-averaged  quantities  and  not  averages 
of  function  increments  and  since  small  errors  are  relatively  less  important  if  the 
calculation  is  restricted  to  maxima. 

However,  in  the  presence  of  oscillating  singularities  the  standard  WTMM 
method  gives  irrelevant  information  on  the  Holder  regularity  of  the  function.  In 
general,  two  exponents  h ,  (5  are  necessary  to  describe  the  singular  behaviour  of  a 
function  f(x ),  namely  the  Holder  exponent  h  and  the  oscillation  exponent  0  de¬ 
scribing  the  local  power  law  divergence  of  the  instantaneous  frequency3,4.  If  f{x) 
contains  oscillating  singularities  the  regularity  of  the  primitive  of  f(x)  depends  on 
/3.  In  this  case  the  Holder  exponent  does  not  increase  by  1  as  in  the  case  of  a 
cusp  singularity  but  by  p  +  1.  Thus,  the  singularity  spectrum  of  general  functions 
depends  on  both  exponents,  D(h,/3). 

In  order  to  extract  Holder  exponents  and  to  quantify  at  the  same  time  the 
oscillating  behaviour  we  propose  to  use  a  family  of  complex  progressive  wavelets5 

tyn} 


ip0(x)  =  ( eiU0X  -  e_u;°/2)e  x*/2,  ipn{x)  “  ^n-iW  (neZ)  (1) 
with  an  increasing  number  of  vanishing  moments 


+oo 

J  xkin{x)dx  =  0  for  0  <k<n  (2) 

—  OO 


For  a  sufficiently  large  frequency  cjo  (^o  >  5  for  practical  purposes),  ipo  can  be 
replaced  by  the  Morlet  wavelet. 

Using  these  wavelets  we  follow  similar  ideas  as  in  an  earlier  paper2,  where  an 
algorithm  was  developed  for  a  family  of  real  valued  wavelets  based  on  derivatives  of 
the  Gaussian  distribution,  which  allows  a  direct  tracing  of  the  skeleton  of  wavelet 
transform  maxima  lines2.  Using  the  following  abbreviation  for  the  continuous 
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wavelet  transform 

+oo  _ 

Wnf:=Wnf(a,b)  =  ±  [  f(x)M—)dx  (a,beR,a>0)  (3) 

Cl  J  CL 

— oo 

where  a  denotes  the  scale,  b  the  shift  parameter,  and  the  complex  conjugate  of 
we  set  up  a  partial  differential  equation  for  the  wavelet  transform 

f  d2  8  .  d  n\ 

{aW-d-a-W°Fb  +  a)W  f  =  0  W 

Describing  the  maxima  line  in  a  parametric  form  {a(t),  b(t)},  the  motion  along  this 
line  is  given  by  two  ordinary  differential  equations 


with  a  constant  C,  which  can  be  integrated  numerically.  Similar  equations  can  be 
derived  for  the  ridges  5.  The  skeleton  of  maxima  lines  can  be  continuously  traced 
up  to  the  desired  accuracy  with  a  reduced  computational  effort.  Based  on  these 
maxima  lines  a  partition  function  Z  (p,  q ,  a)  is  defined 

Z(p,q,a)=  £  (sup  \Wnf(a,bi)\)qdi(a)p  (6) 

i€max. lines  a  <'a 

where  d*(a)  denotes  the  distance  between  the  i-th  and  the  (i  +  l)-th  maxima  line. 
In  comparison  with  classical  methods,  the  main  advantage  of  the  proposed  direct 
tracing  algorithm  is,  that  the  supremum  along  each  maxima  line  can  be  easily 
evaluated. 

Exploiting  the  scaling  behaviour  of  the  partition  function  Z(p,q,a)  ~  aT (*»«) 
and  applying  the  Legendre  transform  to  the  exponent  r(p,g),  one  obtains  the  two- 
dimensional  D(h,f3)  singularity  spectrum3 

D(h ,  0)  =  +  p(P  +  1)  -  r(p,  q))  (7) 

In  practice,  due  to  the  finite  length  and  noise  in  data  sets,  the  accuracy  of  the 
Legendre  transform  might  decrease.  In  order  to  avoid  this  problem,  we  propose  a 
generalization  of  the  canonical  method  of  Chhabra  and  Jensen6  to  two-dimensional 
spectra  which  allows  a  direct  computation  of  the  singularity  spectrum  D(h,/3). 
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Intracavity  doubled  solid  state  lasers  based  on  Nd-doped  crystals  are  efficient 
and  compact  sources  of  coherent  visible  optical  radiation.  When  such  lasers  operate 
in  three  or  more  longitudinal  cavity  modes,  irregular  fluctuations  of  the  output 
intensity  may  occur.  This  behavior,  referred  to  as  the  green  problem,  has  been 
reported  for  the  first  time  by  Baer  1.  He  found  that  these  instabilities  arise  from  a 
coupling  of  the  longitudinal  modes  of  the  laser  by  sum-frequency  generation,  which 
occur  in  the  intracavity-doubling  crystal.  When  the  laser  does  not  contain  the 
nonlinear  crystal  or  when  it  operates  in  a  single  longitudinal  mode,  its  output  is 
stable.  In  the  case  of  two  oscillating  longitudinal  modes,  output  intensity  of  the 
laser  is  stable  only  for  small  values  of  nonlinearity,  otherwise  both  modes  tend  to 
pulse  on  and  off  out  of  phase  with  each  other  l.  When  the  number  of  lasing  modes 
is  larger  than  two,  the  laser  can  exhibit,  depending  on  the  parameters  describing  it, 
various  behaviors  like:  aniphase  dynamics2,3,  clustering1,  grouping4  and  chaotic 
dynamics5. 

The  main  goal  of  this  communication  is  to  study  the  possibilities  of  a  stabilization 
of  large  amplitude  fluctuations  in  such  a  laser,  i.e.,  an  intracavity- doubled  Nd:YAG 
laser.  The  analysis  is  based  on  the  Baer-type  rate  equations  1 


.  "M  =  I(P,  t)  (-ap  +  G(p,  t)  -  el(p,  t)  -  2e  £  I(q,  t)  j  , 

=  Gap  -  G(p,  t)  ( 1  +  p (p,  p)I(p,  <)  +  X)  0(p» «)/(«’ )  > 


(la) 

(16) 


=  1 AT, 

where  N  is  the  number  of  longitudinal  modes;  rc  and  rf  are  the  cavity  round 
trip  time  and  the  fluorescence  lifetime  of  the  Nd+3  ion,  respectively;  J(p,  t)  and 
G(p,t)  are,  respectively,  the  intensity  and  gain  associated  with  the  p-th  longitudinal 
mode;  ocp  is  the  cavity  loss  parameter  for  the  p-th  mode;  Gap  is  the  small  signal 
gain;  /3(p,  p)  is  the  self-saturation  coefficient  in  the  active  medium;  /3(p,  q)  is  the 
parameter  describing  the  cross-saturation  between  two  longitudinal  modes,  p  and 
q.  The  parameter  e  is  a  nonlinear  coefficient  whose  value  depends  on  properties  of 
the  nonlinear  crystal  and  it  describes  the  conversion  efficiency  of  the  fundamental 

aKey  words:  Chaos  in  laser  cavities;  Multimode,  intracavity-doubled  solid-state  lasers;  Baer- 
type  rate  equations 
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intensity  into  the  doubled  intensity.  The  terms  cl(p,t)2  and  e/(p,  t)I(q,  t)  in  Eq. 
(la)  account  for  the  loss  in  the  intensity  of  the  fundamental  frequency  through 
second  harmonic  generation  and  through  sum-frequency  generation,  respectively, 
and  they  provides  a  nonlinear  loss  mechanism  that  globally  couples  the  longitudinal 
modes,  i.e.  each  lasing  mode  is  coupled  to  all  other  lasing  modes2.  A  comparable 
amount  of  global  coupling  occurring  in  the  set  of  Eq.  (1)  is  introduced  by  the  cross¬ 
saturation  coefficient,  /3IkGk .  We  use  the  approximation  that  the  cross-saturation 
coefficient  is  constant  for  all  modes,  /3(p,  q)  =  (3,  where  /3  =  §/?0  or  j3  =  |/30,  where 
A>  =  0.06  is  a  scalling  parameter.  We  assume  also  that  the  losses  and  the  small 
signal  gains  are  the  same  for  all  modes,  i.e.  ap  =  aq  =  a,  Gap  -  Gaq  ~  Ga,  where 
p,q  =  1,...,AT.  Other  parameters  describing  the  system  we  have  chosen  in  such 
a  way  that  they  can  describe  a  real  experimental  configuration  of  the  laser,  i.e., 
Td  —  10  [ns],  Tj  —  0. 24 [ms] ,  a  =  0.015,  7  —  0.12.  The  number  of  longitudinal 
modes,  N  =  1,...,250,  and  the  strength  of  nonlinearity,  e  =  10-7  ~  10-3,  are  not 
fixed  and  they  vary  in  the  analysis. 

First,  we  analyze  numerically  the  stabilization  of  the  laser  radiation  by  an 
increase  of  the  number  of  longitudinal  modes,  proposed  in2.  We  observe  that  the 
theoretically  obtained  3  linear  dependence  of  the  minimal  number  of  modes,  which 
are  needed  to  stabilize  the  laser  output,  on  the  strength  of  nonlinearity  agree  with 
the  numerical  solutions  only  in  the  case  of  sufficiently  small  nonlinearity.  For  large 
nonlinearity  the  minimal  number  of  modes  obtained  by  the  numerical  simulations 
is  larger  than  the  number  which  follows  from  the  theoretical  predictions.  It  is 
caused  by  a  strong  cancellation  of  modes  during  the  evolution.  For  very  large 
nonlinearity  this  cancellation  is  so  strong  that  only  few  modes  remain  (even  when 
there  are  initial  250  oscillating  modes).  Therefore,  a  large  number  of  simultaneously 
oscillating  longitudinal  modes  cannot  be  achieved  in  this  case. 

However,  the  problem  of  the  stabilization  of  the  laser  output  can  be  solved  in 
another  way,  namely,  by  an  increase  of  the  strength  of  nonlinearity,  which  leads 
to  very  strong  competition  between  the  modes,  so  that  during  the  evolution  all  of 
them,  besides  a  single  one,  are  canceled.  As  a  consequence,  a  steady-state  solution, 
which  is  stable  against  small  perturbations,  arises. 

This  way  of  stabilization,  achieved  by  forcing  the  laser  to  operate  in  one¬ 
mode  regime,  is  similar  to  other  approaches  presented  in  the  literature,  where  the 
stabilization  is  obtained  by  inserting  into  the  laser  cavity  an  additional  element  like, 
for  example,  an  etalon  1  or  a  birefringent  crystal.  However,  this  proposed  method 
seems  to  be  a  better  solution,  since  no  additional  element  is  needed  and  the  output 
intensity  of  the  doubled  frequency  is  larger  (it  increases  with  the  increasing  of  the 
strength  of  nonlinearity). 

References 

1.  T.  Baer,  J.  Opt.  Soc.  Am.  B  3,  1175  (1986). 

2.  G.  E.  James  et  al ,  Opt.  Lett.  15,  1141  (1990). 

3.  J.-Y.  Wang  and  P.  Mandel,  Phys.  Rev.  A  48,  671  (1993). 

4.  K.  Otsuka,  Y.  sato,  and  J.  L.  Chern,  Phys.  Rev.  E  56,  4765  (1997). 

5.  M.  E.  Pietrzyk  and  M.  B.  Danailov,  preprint  of  the  ICTP  (the  Abdus  Salam 
International  Centre  for  Theoretical  Physics),  Trieste,  Italy,  IC/98/235  . 


291 


CHAOTIC  DYNAMICS  OF  ELASTIC-PLASTIC  BEAMS 


ULO  LEPIK 

Institute  of  Applied  Mathematics,  Tartu  University,  Vanemuise  46,  51014  Tartu,  Estonia 
Phone:  (3727)375868,  Fax:  (3727)375862,  E-mail:  ylepik@ut.ee 

During  the  last  decades  great  attention  has  been  turned  to  chaotic  vibrations  of  elastic 
beams,  but  not  much  has  been  done  in  the  case  of  elastic -plastic  deformations.  Evidently 
the  first  paper  in  this  field  belongs  to  Symonds  and  Yu  (1985),  who  considered  the 
following  problem.  A  fixed  ended  beam  is  subjected  to  short  intensive  pulse  of  transverse 
loading  that  produces  plastic  deformation.  Since  the  ends  of  the  beam  are  fixed  membrane 
forces  must  be  taken  into  account.  Solving  the  equations  of  motion  Symonds  and  Yu 
found  that  permanent  deflection  may  be  in  direction  opposite  to  the  load.  This 
phenomenon  was  investigated  in  several  papers  by  Symonds  and  his  collaborators.  It 
turned  out  that  the  permanent  defflection  is  very  sensitive  to  small  changes  of  load. 
Fractality  and  self-similarity  which  are  characteristic  to  chaotic  processes,  were 
demonstrated.  For  the  similarity  dimension  the  value  0,78  and  for  the  correlation  fractal 
dimension  -1,44  were  obtained. 

In  most  papers  of  Symonds  and  his  collaborators  the  real  beam  was  replaced  by  a 
Shanley-type  model  with  one  or  two  degrees  of  freedom.  In  some  papers  (see  Symonds 
and  Qian,  1996)  the  Galerkin  method  for  2 DoF  was  used;  some  simplifying  assumptions 
(sandwich  beam,  perfectly  plastic  material,  disregard  of  axial  inertia  forces)  were  made. 

The  aim  of  this  paper  is  to  present  a  method  of  solution,  which  is  based  on  the 
Galerkin  technique  and  is  applicable  for  beams  with  an  arbitrary  number  of  DoF.  Basic 
elements  of  this  method  were  worked  out  by  Lepik  (1994,  1995). 

To  be  brief  we  shall  consider  beams  with  a  rectangular  cross-section,  B,h  and  L  are 
the  width,  thickness  and  length  of  the  beam,  respectively.  The  equations  of  motion  are 


■  Mr* 

dx  {  dx 


p(x,t)-pBh — £ 
Ot 


where  p  is  density,  u  -  axial  displacement,  w  -  deflection.  Axial  force  T  and  bending 
moment  M  we  shall  calculate  from  the  formulae 

hi  2  A/2 

T  -  \a(x,z)dz,  M  ~  jcr(x,z)zdz.  (2) 


We  shall  assume  that  the  beam  material  has  linear  strain-hardening;  elastic  unloading 
and  secondary  plastic  loading  effects  are  also  taken  into  account. 

To  the  equations  (I)  we  shall  apply  the  Galerkin  procedure 

L  L 

\®xdudx  =  0,  \®28wdx  =  0.  (3) 

0  o 

In  the  case  of  a  beam  with  simply  supported  ends  we  shall  seek  the  solution  in  the 
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u  -  er 


l-y)+  Y,ak  sinA:^-— 
v  LJ  *=1  L 


w=  ZfiSintcff—, 
*= l  L 


(4) 


where  e(0)  =  e0  and  the  coefficients  e0  ,ak,ft  are  subject  to  variation.  The  integrals  (2) 
will  be  calculated  numerically;  replacing  these  results  into  equations  (3)  we  find  the 
second  derivatives  e0,dk,fk.  The  quantities  e0,ak,fk  are  evaluated  according  to  the 

method  of  central  finite  differences  (for  getting  a  stable  solution  the  time  increment  At 
must  be  sufficiently  small). 

In  order  to  demonstrate  the  efficiency  of  the  proposed  algorithm  several  numerical 
examples  are  examined.  With  the  purpose  to  establish  chaotic  effects  the  deflection 
history,  phase  and  power  spectrum  diagrams  are  put  together.  It  turned  out  that  chaotic 
motion  of  the  beam  may  exist,  especially  in  the  initial  phase  of  motion;  as  to  the  long 
term  motion  then  it  translates  to  periodic  vibrations  of  smaller  amplitude. 
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Riemann’s  example  of  a  continuous,  non- differentiable  function  is  given  by  the  sum 
Hn=is[n(n2x)/n2-  This  function  is  sufficiently  irregular  and  its  graph  is  fractal. 
Hardy  proved  1  that  Riemann’s  non-differentiable  function  is  not  differentiable  at 
any  irrational  point  because  of  a  square  root  singularity  at  these  points.  Careful 
investigation  of  the  differentiability  of  Riemann’s  non-differentiable  function  was 
carried  out  by  Gerver,  who  showed  2  that  this  function  has  derivative  equal  to  —1/2 
at  every  rational  point  of  a  special  type  (forming  the  orbit  of  the  point  1  under  the 
theta- modular  group3).  Different  proofs  of  this  surprising  fact  was  given  by  other 
authors3,4,5,  providing  also  a  close  relation  between  Riemann’s  non-differentiable 
function  and  classical  0- function  and  Gauss  sums.  Duistermaat  obtained  3  an  exact 
functional  equations  for  this  function  under  transformations  of  the  theta-modular 
group.  In  this  article  we  use  functional  equations  for  Riemann’s  non-differentiable 
function  under  theta-modular  transformations  to  derive  functional  equations  on 
Gauss  sums  generalizing  Genocci-Schaar  identity. 

A  Gauss  sum  is  a  sum  of  the  form 

9-1 

S(p,q)  =  ^2exp(nin2p/q), 

n=0 

where  p  and  q  are  relatively  prime  integers  of  opposite  parity,  i.e.  one  is  odd  and 
the  other  is  even.  The  Genocci-Schaar  identity  on  Gauss  sums  is  the  following:  for 
positive  integers  p  and  q  of  opposite  parity, 

~  V  exp(nin2p/q)  =  eXp^/4l  ^exp(-7rm2g /p). 

W  n= 0  VP  n=0 

This  identity  can  be  interpreted  as  the  transformation  of  the  Gauss  sum  under  the 
change  a  :  z  — +  — 1/2  where  z  =p/q. 

The  modular  group  T  is  a  group  of  fractional  linear  transformations  7  :  2  — » 

(az  +  b)/(cz  +  d)  where  a,b,c,d  G  Z  and  ad  —  be  =  1.  Theta-modular  group  T#  is 
a  sub-group  of  modular  group  generated  by  the  following  mappings:  r  :  2  2  +  2 

and  a  :  2  —*  —1/z.  For  any  element  of  the  theta-modular  group  the  following  are 
valid:  ab  =  0  ( mod  2)  and  cd  =  0  ( mod  2).  Every  fractional  transformation  7  gTq 
has  a  simple  pole  at  the  point  2  =  —d/c.  The  rational  points  x—pjq  with  integers 
p,  q  of  opposite  parity  and  infinity  constitute  theta-modular  orbit  of  point  0. 

Riemann’s  non-differentiable  function  has  the  following  local  estimations  at  the 
point  x :  f(x  ±  e)  —  f(x)  +  R±es  ,  where  0  <  8  <  1.  Using  the  functional  equations 
for  f(x)  it  is  possible  to  find  functional  equations  on  the  functions  R±  .  These 
functions  are  known  only  at  the  rational  points,  where  they  coincide  with  the  Gauss 
sums.  The  following  theorems  describe  these  functional  equations: 
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Theorem:  Let  r  and  s  be  integers  of  which  one  is  even  and  the  other  is  odd, 
and  s  is  positive,  7  G  be  an  element  of  the  theta- modular  group  7  :  2  — » 

( az  +  b)/(cz  +  d),  and  suppose  c  is  positive  and  cr  -f  ds  ^  0  .  Define  r  —  ar  +  bs 
and  s  =  cr  +  ds.  Then  the  following  formula  for  Gauss  sums  is  valid: 


exp(7 HSign(s  )/4) 


s(r  ’  S  )  2fc  c)  =  71  5(r’ s) 


(1) 


Proof:  The  proof  is  based  on  the  differentiability  properties  of  Riemann’s 
non-differentiable  function.  Let  us  consider  the  following  function  = 

J2n=iexP(™n2z)/7Tin2.  Using  technique  of  papers4’5  we  calculate  the  following 
estimation  for  the  function  <fi(z)  at  the  point  x  =  u/v,  where  u  and  v  are  relatively 
prime  integers  of  opposite  parity,  ( u ,  v)  =  1  and  uv  =  0  ( mod  2) : 


+00 

<f){x  +  h2)  -  4>(x  -  h 2)  =  h2  expirin'2 x)ip(nh)  —  h2  =  (2) 

n~  —  00 


|v| — 1  +00 

h2  22  exp  (7 vit2x)  22  +  th )  -  h2  =  2 1/2 S(u,  v)h/\v\  +  0(h2). 

t= 0  k  —  —  co 

Here  ip(x)  =  sm^x2) / %x2  if  x  /  0  and  <p(x)  =  1  if  x  =  0,  and  we  write  n  = 
k\v\  +  t,  (0  <  t  <  |v|)  and  use  that  exp(7r in2x)  =  exp(7r^2x),  since  uv  =  0  ( mod  2). 
The  function  <j>(z)  obey  a  functional  equation  under  the  action  of  theta-modular 
group3.  Let  7  G  T#  be  an  element  of  theta  modular  group,  then  the  function 
ip(z)  —  (/>(z)  -  7  (z)~1py(z)<p(j(z))  is  differentiable,  and  analytical  function  /i7  is 
given  by: 

P-rto  =  e^c~\z  +  d/c)-^2S(-d ,  c).  (3) 

Eq.  (2)  is  valid  for  any  point  of  theta- modular  orbit  of  x  =  r/s,  except  infinity. 
Calculating  estimation  (2)  for  the  differentiable  function  ^(x)  at  the  point  x  =  r/s 
and  supposing  7 (r/s)  ±  00,  we  find  that 

^  - 1  W-Mr/.) S(ar  ±  ^  J ,+  ^  -  0.  (4) 

5  |cr  +  ds\ 

Note,  that  r  —  ar  +  bs  and  s  =  cr  +  ds  are  relatively  prime  integers.  Substituting 
the  expression  of  /r7( z )  in  terms  of  Gauss  sums  (Eq.  3)  we  obtain  the  desired  result. 

Q.E.D 

The  Genocci-Schaar  identity  corresponds  to  the  case  of  a  =  0,  b  =  -1,  c  =  1  and 
d  =  0.  Functional  equation  (Eq.  1)  describe  the  theta-modular  transformations  of 
Gauss  sums  and  can  be  used  to  derive  the  values  of  the  Gauss  sums. 
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The  purpose  of  the  paper  is  to  introduce  a  new  application  of  the  multifractals  in  the 
reliability  engineering,  risk  analysis,  historical  chronology  and  others  fields  where  we 
deal  with  series  of  events  of  various  natures.  The  spectrum  of  such  series  is  quite  broad 
In  particular,  we  do  not  know  whether  some  temporal  pattern  is  hidden  in  an  apparently 
disordered  set  of  events.  The  multifractal  theory  is  a  good  basis  for  revealing  such  an 
order  and  describing  the  event  sequence  in  time.  It  can  provide  a  deeper  understanding  the 
nature  of  the  event  flow. 

A  mathematical  construction  that  represents  an  event  sequence  as  a  set  of  the  random 
points  on  the  time  scale  is  referred  to  as  a  stochastic  point  process.  It  can  either  be 
modeled  as  a  list  of  impulses  located  at  times  where  events  occur  or  as  a  count  process. 
Let  S  be  a  sample  of  events  of  limited  size  N0  during  the  specified  period  of  time  [0,  xmax]. 
The  process  time  history  for  the  sample  S  is  represented  by  a  sequence  of  idealized  impul¬ 
ses  of  vanishing  width,  located  at  specified  moments  of  the  event  time  i  =  1,  2,...,  N0. 
Further,  let  us  rescale  the  time  vx  of  every  i-th  member  of  sample  S  on  the  maximum  value 
xmax  ti  =  t/w,  so  we  can  consider  the  event  distribution  on  the  unit  interval  of  time  T  = 
[0,  1].  In  order  to  characterize  this  distribution  we  divide  the  unit  interval  into  temporal 
subintervals  of  duration  At  =  2*n.  So  N=  2n  subintervals  are  needed  to  cover  interval  T, 
where  n  is  the  number  of  generation  in  the  binary  subdivision  of  the  temporal  interval  T. 
The  distribution  of  the  sample  population  over  die  temporal  interval  is  specified  by  the 
numbers,  Nj,  of  members  of  the  sample  S  in  the  j-th  subinterval.  We  use  the  fraction  of 
the  total  population  ^  =  Nj/N0  as  a  probabilistic  measure  for  the  content  in  subinterval  Atj. 
The  set  of  such  measures  presents  a  complete  description  of  the  event’s  distribution  on  the 
unit  temporal  interval  T  at  stated  resolution  At  [3]. 

Now  let  us  consider  a  case  that  satisfies  the  Bernoulli  trial  conditions.  In  our  inter¬ 
pretation  an  event  of  interest  is  the  failure  occurred  on  the  first  half  of  temporal  interval 
with  probability  p.  The  series  of  Bernoulli  trials  with  parameter  p  is  a  sequence  of  indep¬ 
endent  trials  in  which  there  are  only  two  outcomes,  and  probability  p  remains  the  same 
for  all  generations  of  the  binary  subdivision  process  of  interval  T.  Three  cases  associated 
with  typical  form  of  cumulative  distribution  function  are  considered:  process  with  early 
events  (when  p> 0.5),  Poisson  process  (p=0.5),  and  process  with  late  events  (p<0.5).  In  the 
case  of  Bernoulli  trials  the  measure  \x  is  recursively  generating  by  a  multiplicative 
binomial  process.  The  binomial  cascade  provides  an  example  of  a  probability,  which  has 
a  rich  asymptotic  structure  and  is,  in  modem  terms,  multifractal  [3].  Connection  between 
parameter  p  of  the  Bernoulli  trial  and  multifractal  spectrum  is  considered. 

For  the  approbation  of  the  technique,  a  computer  simulation  study  has  been  done.  As 
a  first  step,  we  have  carried  out  a  multiscale  analysis  of  data  generated  by  Bernoulli  trials. 
The  binomial  multiplicative  process  produces  shorter  and  shorter  temporal  subintervals  At 
that  contain  less  and  less  fractions  of  the  total  measure  and  generates  a  multifractal 
probabilistic  measure,  supported  by  Cantor  set  on  the  unit  temporal  interval. 

In  order  to  verify  the  ffactality  of  data  sets  obtained  from  numerical  simulations,  tests 
and  inspections  of  real  devices  a  wavelet  analysis  was  carried  out.  The  analysis  shows 
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that  maxima  lines  of  the  continuous  wavelet  transform  (CWT)  coefficients  are  converging 
towards  the  singularities  of  the  measure,  and  they  reproduce  its  hierarchical  structure  [1]. 
The  successive  forkings  occurred  at  different  scales  reveal  the  multifractal  nature  of 
measure.  Failure  occurrence  is  probabilistic  process,  which  results  in  the  formation  of 
self-similar,  or  rather  self-affine  temporal  clusters.  Overwhelming  evidence  from  com¬ 
puter  simulation  of  different  measures  indicates  that  these  patterns  are  self-affine  fractals, 
meaning  that  their  complication  is  the  same  at  different  scales  of  observation. 

The  wavelet  analysis  was  carried  out  by  using  the  WaveLab  package  [2]a  The 
multiffactal  structures  proposed  in  the  sequence  of  events  are  real-time  structures,  in  cont¬ 
rast  to  fractal  attractors,  which  reside  in  phase  space.  Thus  the  wavelet  transform  can  be 
applied  directly  to  a  series  of  statistical  data  on  reliability  obtained  from  experiments  and 
inspections  of  technical  state  under  real  service  conditions  [4].  The  graph  of  the  CWT 
coefficients  of  the  failure  time  history  shows  that  the  successive  forkings  produce  a 
multifractal  temporal  structure.  Increasing  the  magnification  of  the  wavelet  transform 
microscope  reveals  progressively  the  successive  generations  of  branching.  The  symmetry 
of  the  graph  is  broken  by  non-uniformity  of  probabilistic  measure.  Let  N(o)  be  the 
number  of  maxima  lines  in  the  CWT  skeleton  at  the  scale  a.  In  the  limit,  as  the  scale  a 
tends  to  0,  the  ratio  ln(N(a)/ln(o)  is  associated  to  the  exponent  a  [1].  The  concentration  of 
data  points  around  the  straight  line  observed  in  the  plot  of  \n(N(a))  versus  ln(a)  can  be 
regarded  as  a  quantitative  indication  of  the  self-similarity  of  the  event  sequence  in  real 
data  sets.  The  ratio  between  the  time  scales  of  successive  generation  can  take  different 
values,  which  is  another  indication  that  a  multiffactal  description  is  appropriate.  Wavelet 
analysis  of  empirical  data  on  reliability  provides  probabilistic  evidence  for  the  existence 
of  a  multiplicative  process  hidden  in  the  temporal  ordering  of  the  failure  sequence. 

The  plots  of  empirical  cumulative  distribution  function  of  the  lifetime  data  have  an 
evident  feature,  namely,  they  are  constant  almost  everywhere  except  in  those  points 
where  failures  occur.  In  the  limit,  the  empirical  distribution  function  resembles  in  a  sense 
a  devil  staircase  prefractal.  In  the  proposed  multiffactal  approach  the  emphasis  switches 
to  letting  the  statistical  data  "speak  for  themselves",  rather  than  approximating  the  lifetime 
distribution  by  one  of  the  parametric  models.  In  practice,  we  are  often  interested  in 
prediction  of  the  mean  time  of  a  failure-free  operation  as  well  as  the  others  reliability 
indexes.  Deeper  insight  into  failures,  their  prediction  and  prevention  is  to  be  gained  by 
using  the  multifractal  approach  in  the  reliability  engineering. 
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THE  FRACTAL  NATURE  OF  WOOD  REVEALED  BY  DRYING 
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Nature  provides  us  with  a  considerable  variety  of  shapes  and  forms,  from  the  sim¬ 
ple  to  the  complex.  There  is  much  current  interest  in  the  application  of  fractal 
geometry  1 ,  to  describe  the  complex  phenomena  related  to  the  natural  shapes  and 
the  structure  exhibiting  an  intricate  and  irregular  form.  The  degree  of  geometric 
irregularity  of  an  object  can  be  measured  by  fractal  dimension,  such  as  length  of 
coastlines,  molecular  surface,  aerogels,  bacterial  colonies,  tumor  cells,  axon  termi¬ 
nals,  tree  crown  properties,  and  so  on.  Wood  is  a  typical  porous  material  that 
exhibits  interconnected  pores  composed  of  biological  elements.  The  pore  space 
consists  chiefly  of  cell  cavity,  intercellular  hole  and  void  in  cell  wall,  which  shape 
a  complex  structural  space  of  wood.  Many  reports  had  shown  that  the  pore  space 
characteristics  within  wood  bear  a  close  relation  to  the  water  content,  because  the 
structure  and  size  of  the  pore  space  changes  when  water  evaporates  and  escapes 
from  wood.  There  are  also  some  results  on  the  relationships  between  the  pore 
space  and  the  water  content  within  wood.  Most  of  them,  however,  did  not  take 
into  account  the  objective  regularity  and  can  not  reveal  the  complexity  of  wood 
structure. 

Introducing  the  fractal  theory  into  this  field  will  provide  new  insights.  The 
fractal  dimension  of  wood  surface  was  first  studied  by  using  water-sorption.  It  was 
found  2  that  the  fractal  dimension  was  in  the  range  of  2.5  —  2.8.  An  experiment  3 
with  pressurized  water  absorption  has  confirmed  the  fractal  nature  of  wood.  The 
findings3  have  shown  that  the  pore  space  within  wood  could  be  characterized  by 
fractal  dimension  or  a  set  of  fractal  dimensions.  But  there  was  no  explanation 
leading  to  the  characterization  of  wood  properties. 

A  typical  property  of  fractals  relates  their  volume  V  to  the  corresponding  linear 
size  L,  as 

V{L)<xLdf,  (1) 

where  df  is  in  general  a  non-integral  and  df  <  d.  Herein  d  is  the  Euclidean  di¬ 
mension  of  the  space  in  which  the  object  is  embedded  4.  This  relation  is  familiar 
to  us  when  dealing  with  the  usual  objects  such  as  lines,  discs,  or  spheres.  The 
mass  change  (AM)  of  a  sample,  or  the  mass  of  bound  water  evaporated  at  different 
temperatures  (T),  is  a  function  of  the  linear  dimension  ( L )  of  the  sample  and  the 
drying  temperature  (AM  =  AM(L,T)).  As  previously  stated,  the  mass  change 
AM  represents  the  volume  of  the  pore  space  in  the  cell  wall  of  the  sample.  Hence 
from  Eq.  (1),  it  can  be  expressed  as 

AM{L,T)  <x  Ldf<-P)  (2) 

where  df(P)  is  the  fractal  dimension  of  the  pore  space  in  cell  walls  of  a  wood 
sample.  The  constant  of  proportionality  is  not  important  to  us;  only  the  scaling 
is  important  to  the  fractal  nature  of  the  structure  of  pore  space.  Take  the  natural 


298 


Table  1.  The  fractal  dimensions  of  Ginkgo  and  Chinese  chestnut  obtained  at  varable  temperatures 


species 

20°C 

40°  C 

60°C 

80°C 

100°C 

Ginkgo 

Chinese  chestnut 

2.106 

2.008 

2.547 

2.566 

2.851 

2.814 

2.863 

2.896 

2.876 

2.972 

logarithm  of  both  sides  of  Eq.  (2)  and  we  have 

ln(AM)  oc  df(P)ln(L). 

If  a  set  of  AM  and  L  is  acquired,  df(P)  can  be  obtained  from  a  log  -  log  plot  of 
the  mass  change  AM  and  the  linear  dimension  L. 

The  materials  for  this  investigation  came  from  two  species,  one  was  a  37- year- 
old  plantation-grown  Ginkgo  (Ginkgo  biloba)  and  the  other  was  a  48-year-old 
plant  at  ion- grown  Chinese  chestnut  (Castanea  mollissima).  After  the  discs  were 
completely  air-dried,  they  were  sawn  from  pith  to  bark  in  the  radial  direction. 
Then  11  cubic  blocks  were  sawn  near  the  center  of  radial  face  of  the  discs  and 
sanded.  The  three  dimensions  of  each  cubic  block  increased  in  steps  of  5mm  from 
5  mm  to  55  mm.  The  initial  moisture  of  the  samples  was  14%.  The  samples  were 
weighed  and  measured  first.  Then  they  were  dried  in  oven  at  temperatures  of  20°C, 
40°C,  60°C,  80°C,  and  100°C,  respectively.  The  duration  of  drying  was  four  hours 
in  all  cases.  After  each  drying  was  completed,  samples  were  weighted  and  measured 
again. 

It  was  found  that  the  fractal  dimensions  of  the  pore  space  of  wood  increase  as 
the  temperature  increases.  The  fractal  dimension  is  an  intrinsic  property  of  the 
pore  space  and  can  be  as  a  new  parameter  for  the  characterization  of  wood.  The 
fractal  dimensions  obtained  in  this  paper  are  in  the  range  of  2.1057  —  2.8757  and 
2.0080  —  2.9718  for  Ginkgo  and  Chinese  chestnut,  respectively.  They  reflect  the 
complexity  and  irregularity  of  structure  of  both  woods.  The  values  obtained  here 
for  the  mass  changes  AM(L,  T)  are  very  sensitive  to  the  nature  of  the  volume  of  the 
pore  space,  and  the  fractal  dimensions  should  depend  on  the  nature  of  the  escaped 
water  used  in  the  experimental  procedure.  Compared  with  the  experiment  with 
water  at  a  given  pressure,  the  experiment  procedure  in  this  paper  is  simpler  and 
easier  to  control.  Using  this  method,  the  geometric  set  composed  of  pores,  voids, 
and  micro  voids  in  wood  can  be  described. 
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The  main  topic  presented  in  this  paper  concerns  the  reduction  of  the  diffusion  volume  to 
convex  isotropic  Euclidean  structures  as  polygons  and  convex  polyhedrons,  in  order  to 
estimate  the  diffusive  acoustic  behavior  of  a  fractal  structure. 

The  volume  V  and  the  surface  S  of  an  Euclidean  structure  are  defined  with  both 
quadratic  and  cubic  relationships  to  the  perimeter  P  with  S  =  Fq  P2,  and  V  =  Fc  P3. 

Their  geometrical  nature  depends  on  the  values  of  the  prefactors  in  those  relationships, 
the  shape  factors  Fq  and  Fc  [1].  Extension  of  this  relationship  to  the  structure's 
generalized  volume  Vx  [2]  gives  us  the  equation  : 

vx=fxpxd.  a) 

Calculation  of  the  corresponding  shape  factors  respectively  gives  l/4n  for  a  circular  and 
3/327T  for  a  spherical  structure.  In  order  to  generalize  this  expression  to  the  whole 
platonician  2-D  family,  we  operate  a  polygonal  circle  quadrature  with  the  convexes  from 
square  to  n-sided  polygons,  n  ->  oo . 

The  quadrature  factor  Q  represents  the  ratio  between  the  circumscribed  and 
inscribed  circles  of  the  structural  polygons  of  the  analyzed  objects.  They  are  varying 
from  Q  =  2  for  the  square  (minimal  quadrature)  to  Q  =  1  for  the  circle  (total 
quadrature).  The  general  expression  is  given  by  : 


2 


Ear  ■ 

— ^ sin  — 
2  n 

2  ,  n 
rlm  tan  — 


(2) 


Taking  into  account  D,  fractal  dimension  and  d,  Euclidean  dimension  of  the  structure, 
the  Fx  factor  expression  becomes  : 

F= _ t _  <3> 

2dQ(d-l)2 

Eq.  (3)  gives  the  Fx  value  of  the  square  (Fx  -  1/16),  the  cube  (Fx  =  1/64),  the  circle 
(F  -  4rc)  and  the  sphere  (F  =  3/327t).  It  gives  also  the  intermediate  values  of  the  regular 
polygonals  which  square  the  circle,  noticing  that  for  dodecagon,  the  prefactor  Fx  reaches 
the  circle  constant  value  Fx  =  l/4n.  This  confirms  the  quadrature  convergence. 

By  replacing  Fx  with  its  expression  in  equation  (3),  one  can  write  the  perimeter- 
surface  relationship  (P/Sx)  for  irregular  2-D  shapes  and  to  closed  2-D  fractals,  with  its 
quadrature  factor  Q  and  its  Euclidean  dimension  d  as  follows  : 


300 


s  = _ _ _ pD 

We  are  using  the  Minkowski  analysis  method  to  scrutinize  the  structure  with  perimeter  P 
and  real  diffusion  section  Sx.  The  structuring  element  of  radius  A  we  use,  gives  the 
fractal  dimension  of  the  diffusion  volume  depending  on  perimeter  P,  true  diffusion 
section  Sx  and  the  structure's  fractal  dimension  D  as  follows  [3]  : 


PA  P 


(5) 


This  leads  to  the  definition  domain  of  the  diffusion  coefficient  5  which  evolves  [4]: 


f_  1  .  S.Q-D) 
2  ox  2PA  t 


(6) 


where  At  represents  the  diffusion  time-share  in  the  structure.  This  leads  to  the  diffusion 

time  expression,  depending  on  the  fundamental  length  of  acoustic  diffusion  process  A™*, 
and  the  roughness  parameter  n  (number  of  indentations  per  length  unit)  expressed  as  : 

i\--0 


pdd+D-d)- 


nA2 


fd-1 


(7) 


by  replacing  diffusion  time  Twith  its  expression  in  (7),  the  diffusion  coefficient 
becomes : 


S  = 


2  v 


Fx(l-D) 


pd(\+D-d)-\ 

nA2 


f 


‘d- 1 


(8) 


As  we  can  see  equation  (8),  this  coefficient  is  defined  with  only  the  structural 
morphologic  diffusion  parameters  and  the  sound  frequency/ 

That  result  leads  us  to  confirm  that  the  diffusion  coefficient  is  a  geometrical- 
dependant  parameter,  confirming  the  rule  of  the  structure  complexity  on  acoustics. 

Thus,  after  a  multiscale  renormalisation,  the  diffusion  coefficient  has  to  be  spectrally 
defined  by  adjusting  a  simulation  procedure  to  experimental  results,  in  order  to 
implement  this  diffusive  acoustic  model  for  any  complex  configuration :  those 
perspectives  constitutes  the  next  step  of  our  research  works. 
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Chinese  acupuncture- moxibust ion  is  a  therapeutic  method  for  injecting  or  burning 
moxa  (a  kind  of  herb)  on  the  acupuncture  points  (acupoints)  of  the  body  surface, 
used  to  treat  diseases.  There  are  over  one  thousand  acupoints  on  the  body  surface. 
In  the  clinic,  the  acupoint  system  on  the  human  body  can  be  divided  into  two  large 
parts:  the  channel  system  (macro- acupuncture)  and  the  micro-acupuncture  system. 
But  the  whole  acupoint  system  can  be  divided  into  many  different  hierarchical 
holographic  units.  Such  system  possesses  fractal  structure  1. 

We  found  that  the  acupuncture  holographic  units  can  be  divided  into  two  kinds. 
In  one,  their  acupoints  distribution  graphs  are  similar  to  the  shape  of  the  human 
nerve-embryo;  in  the  other  these  graphs  are  similar  to  the  shape  of  a  mature  fetus. 

In  a  7- week  old  human  fetus  (nerve  embryo  1.7  cm,  2g)  its  organ  differentiation 
has  already  begun,  as  has  its  limb  growth.  Its  ’body  segment’  development  is  very 
good.  The  whole  nerve-embryo  can  be  divided  into  many  ’body  segments’.  Each 
of  these  contains  a  ’nerve  segment’  a  dermatotome  of  skin  (includes  acupoints)  and 
the  corresponding  organ.  The  nerve  segments  can  control  skin  and  organ  in  the 
same  body  segment. 

The  whole  channel  system  is  an  acupuncture  holographic  unit.  The  course  of 
the  channels  runs  parallel  with  spinal  column  on  the  trunk.  The  acupoints  in  the 
channel  are  distributed  at  equal  distances.  The  channel  acupoints  distribution  pos¬ 
sesses  segmental  structure.  It  has  been  found,  in  the  anatomy  results,  that  there 
is  a  good  relationship  between  the  acupoints  distribution  on  the  body  surface  and 
that  of  endings  of  the  spinal  nerves.  The  channel  acupoints  receive  segmental  inner¬ 
vation  of  spinal  nerve.  Although  the  extremities  have  no  segmental  structure,  the 
channel  acupoints  on  the  extremity  parts  still  receive  segmental  innervation  of  the 
spinal  nerve.  Therefore,  the  acupoints  distribution  and  innervationn  of  the  chan¬ 
nel  system  retains  the  segmental  structure.  The  channel  system  is  an  acupuncture 
holographic  unit  which  is  similar  to  the  nerve-embryo. 

The  back  acupoint  system  on  the  trunk  is  part  of  the  channel  system.  It  is  an 
acupuncture  holographic  unit  too.  The  acupoints  distribution  retains  the  segmental 
structure.  Its  acupoints  receive  segmental  innervation  of  the  spinal  nerve.  Thus, 
the  back  acupoint  system  is  similar  to  the  nerve-embryo. 

There  is  the  same  micro- acupuncture  system  in  each  different  hierarchical  part  of 
the  human  body  2.  The  acupoints  distribution  graphs  of  these  micro- acupuncture 
systems  retain  segmental  structure.  They  are  similar  to  the  shape  of  the  nerve- 
embryo.  But  the  acupoints  in  these  systems  receive  super- asegmental  innervation. 

On  the  other  hand,  in  some  acupuncture  units,  their  acupoints  distribution 
graphs  resemble  the  shape  of  a  fetus.  It  was  found  3  that  the  acupoints  distribution 
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graph  of  ear  is  just  like  an  inverted  fetus  in  a  womb.  The  acupoints  of  organs 
originated  from  the  same  germinal  layer  (entoderm,  ectoderm  and  mesoderm)  are 
distributed  over  the  same  area  in  the  ear.  Te  acupoints  in  the  ear  system  receive 
superasegmental  innervation  of  the  nerve  system. 

The  acupoints  distribution  graphs  of  the  face-  and  nose- acupuncture  systems 
are  similar  to  the  mature  fetus,  but  there  are  deformations.  These  are  copies  of 
the  shape  of  the  mature  fetus  that  are  compressed  along  different  direction  with 
different  proportion.  The  acupoints  of  face- and  nose- acupuncture  systems  receive 
super- asegmental  innervation. 

There  are  many  different  hierarchical  acupuncture  holographic  units  of  the  chan¬ 
nels  on  the  extremity.  Their  acupoints  distribution  graphs  are  similar  to  the  shape 
of  mature  fetus  with  deformation. 

To  sum  up,  the  acupuncture  holographic  units  can  be  divided  into  two  groups: 
the  acupoints  distribution  graphs  either  show  similarity  in  shape  to  the  nerve  em¬ 
bryo  or  to  a  mature  fetus. 

It  is  well  known  that  a  self- affine  system  can  be  divided  into  many  parts,  each 
part  is  a  copy  of  the  whole  which  is  compressed  along  different  directions  with 
different  proportions.  We  have  found  that  this  concept  can  be  used  to  describe  the 
acupuncture  holograpic  unit  system  made  of  units  similar  to  the  shape  of  mature 
fetus.  Each  unit  in  this  system  is  a  copy  of  the  mature  fetus,  which  develops  along 
different  direction  with  different  proportions.  Their  acupoint  distribution  graphs 
are  similar  to  the  shape  of  mature  fetus  but  with  deformations. 

The  tructure  of  a  self- affine  system  generator  T  is 


T(F)  —  5i(/)  U  . . .  U  Sm(f)  (1) 

Si(f)  =  {AiX  +  a,i  :  x  £  f}  (2) 

A{  =  a  rotary  matrix  x 

in  which  0  <  A . . . ,  \{d  <  1.  Here  \{l  is  a  development  factor  of  number  i  part 
of  the  whole,  along  the  l  direction.  The  change  defined  by  Eqs.  (2)  and  (3)  is  a 
seif- affine  change. 

We  have  found  that  the  concept  of  a  self-similar  system  can  also  be  used  to 
describe  the  acupuncture  system.  This  will  happen  if  d  =  1  in  the  above  equations. 

There  is  a  close  relationship  between  the  acupuncture  holographic  units  and 
fetus  development.  The  fractal  theory  provides  a  tool  to  describe  the  two  systems. 
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It  is  a  common  knowledge  that  both  the  solar  plasma  and  the  solar  magnetic  field 
have  a  cellular,  discrete  structure,  which  is  indicative  of  their  possible  fractal  origin. 
The  questions  of  fine  structuring  of  the  solar  magnetoplasma  were  considered  1, 
plausible  mechanisms  of  its  genesis  were  suggested,  and  possibilities  of  describing 
its  fractal  properties  were  put  forward.  The  existence  of  a  whole  hierarchy  of  sizes 
of  active  features  in  the  solar  atmosphere  has  long  been  established.  For  example, 
it  is  well  known  that  in  addition  to  the  granulation  (  1”-1.5”),  such  structures  as 
the  mesogranulation  (  8-15”),  the  supergranulation  (30”-40”),  giant  cells  (20°-40°), 
and  supergiant  cells  (80°-110°)  exist  on  the  Sun.  The  last  two  features  refer  to  the 
so-called  large-scale  organization  of  solar  magnetic  fields  2. 

The  objective  of  this  paper  is  to  investigate  the  fractal  properties  of  large-scale 
magnetic  fields  on  different  spatial  scales. 

This  study  is  based  on  using: 

1 .  Synoptic  maps  of  large-scale  magnetic  fields  compiled  by  Stenflo  3  from  maps 
of  small-scale  magnetic  fields  using  the  data  from  the  Mt.  Wilson  observatory 
(17"  resolution)  from  1959  to  1976  and  from  Kitt  Peak  (1"  resolution)  from 
1959  to  1976.  The  longitudinal-latitudinal  grid  occupies  in  the  longitude  A  the 
range  from  0°  to  360°  with  steps  A  A  =  10°  and  in  the  latitude  <p  from  -90° 
to  90°  with  30  uniform  areas  by  the  sine  of  latitude  c p. 

2.  Synoptic  maps  of  large-scale  magnetic  fields  obtained  4  at  the  Wilcox  obser¬ 
vatory  (180"  resolution)  from  1976  to  1998.  The  longitudinal-latitudinal  grid 
occupies  in  the  longitude  A  the  range  from  0°  to  360°  with  steps  A  A  ==  5°  and 
in  the  latitude  tp  from  —75°  to  75°  with  30  uniform  areas  by  the  sine  of  latitude 

<P- 

These  longitudinal-latitudinal  grids  were  used  in  calculations  in  this  paper.  The 
method  of  scaling  the  variance  of  temporal  and  spatial  series  was  used  to  calculate 
the  fractal  dimension  of  the  maps  5 .  The  method  involves  calculating  the  variance 
F(s)  of  a  given  quantity  in  each  cell  with  area  s.  As  a  result  of  plotting  (on  log  —  log 
coordinates)  the  dependence  of  the  variance  of  the  function  V  ( s )  on  the  cell  size 
s  areas  with  a  linear  approximation  are  identified,  where  possible,  as  F(s)  =  sa 
and  the  coefficients  a  of  this  approximation  are  determined  by  the  method  of  least 
squares.  The  coefficients  a  are  associated  with  the  Hurst  exponent  H  =  a/2  as 
they  were  obtained  by  testing  of  model  temporal  and  spatial  series  with  known  H. 
The  fractal  dimension  of  magnetic  field  distribution  on  given  two-dimensional  map 
is  D  —  3  —  H,  where  3  represents  the  dimensions  of  space. 

The  scaling  of  the  variance  of  magnetic  field  strengths  using  Stenflo’s  maps 
and  the  maps  from  the  Wilcox  observatory  was  investigated  by  this  method 
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(s=A(pxA\).  The  study  has  shown  that  the  spatial  distribution  of  large-scale 
magnetic  fields  on  the  Sun  reveals  a  fractal  character.  This  manifests  itself  in 
the  power  scaling  of  statistical  characteristics  of  magnetic  fields  such  as  the  vari¬ 
ance.  Also,  two  typical  ranges  of  spatial  scales  are  distinguished,  which  appear  to 
characterize  two  systems  of  solar  large-scale  magnetic  fields  organized  into  giant 
(~  AipxAX  <  (25° ±5°)x  (45° ±5°))  and  supergiant  cells  (~  A(pxAX  >  (25°±5°)x 
(45°  ±5°)).  The  fractal  dimensions  of  the  space  of  the  strength  function  of  magnetic 
fields  show  a  high  degree  of  their  irregularity  and  alternation  on  separate  synoptic 
maps.  Time  variations  of  the  exponents  corresponding  to  the  fractal  dimension 
show  a  cyclic  character,  which  testifies  to  a  change  of  statistical  and  associated 
physical  properties  of  solar  magnetoplasma  with  cycle  phases. 
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FRACTAL  ANALYSIS  OF  TIDE  GAUGE  DATA 

N.K.  INDIRA 

CSIR  Centre  for  Mathematical  Modelling  and  Computer  Simulation.  NAL  Belur  Campus 

Bangalore  -  560  037,  India 

One  of  the  important  consequences  of  Global  warming  is  the  general  rise  in  mean  sea 
level.  There  have  been  several  recent  reviews  of  sea  level  rise  and  the  consequence 
of  climate  change.  Long  term  changes  of  mean  sea  level  are  called  secular  changes. 
The  global  changes  in  the  mean  sea  level  are  called  the  eustatic  changes,  whereas 
the  vertical  movements  of  the  land  are  called  eperiogenic  movements.  Out  of  these 
causes  of  sea  level  rise,  only  the  eustatic  rise  is  universal  one.  It  also  appears  that 
tide  gauge  records  contain  long  period  fluctuations  (5-100  years)  which  indicate 
that  the  accurate  extrapolation  of  small  sea  level  rise  values  from  the  data  is  very 
difficult.  For  nearly  a  century,  relative  mean  sea  level  has  maintained  a  steady 
rise  at  many  tide  gauge  stations  around  the  world.  The  relative  mean  sea  level 
change  at  a  particular  location  is  the  difference  between  eustatic  change  and  any 
local  change  in  land  elevation  1,2 . 

Weather  and  climate  system  has  low  dimensional  attractors.  A  consistent  fea¬ 
ture  of  weather  and  climate  data  is  that  they  are  aperiodic  and  their  deviations 
from  periodicity  cannot  be  explained  by  conventional  linear  models  of  time  series 
analysis. 

It  is  therefore  essential  that  the  nature  of  the  sea  level  change  is  analysed  and 
modelled  in  order  to  assess  the  changes  from  one  coastline  to  other  or  from  the 
changes  taking  place  in  one  ocean  from  the  other  neighbouring  oceans.  In  this 
study  the  approach  to  analyse  and  visualize  such  a  comparison  is  done  through 
calculating  the  fractal  dimension  of  the  time  series  on  annual  averages  of  mean 
sea  level  variations  for  different  stations  in  four  countries  representing  two  different 
oceans  regionally. 

The  tide  gauge  data  on  the  coastal  stations  of  the  whole  world  is  maintained  by 
the  Permanent  Service  of  Mean  Sea  Level  (PSMSL)  at  their  site.  The  annual  data 
on  the  tide  gauge  data  were  obtained  through  ftp  for  analysis.  Five  coastal  stations 
in  India  namely,  Bombay,  Cochin,  Madras,  Vishakhapatnam  and  Sagar  were  con¬ 
sidered  4  to  estimate  the  fractal  dimension  of  the  tide  gauge  data.  Using  the  same 
technique  for  interpolating  the  data  points  between  given  observed  points,  namely, 
IFS  technique,  the  fractal  dimensions  were  calculated  for  other  coastal  stations  in 
India  for  which  longer  time  series  were  available.  This  was  done  as  an  additional 
exercise  to  check  the  variation  between  the  values  in  the  fractal  dimensions  for  the 
coasts  of  India.  For  the  construction  of  the  fractal  interpolation  algorithm,  con¬ 
sider  a  set  of  data  {(xi}  z*  )  ;  i  =  0,....,N}  and  construct  an  IFS  in  3ft2  such  that  its 
attractor,  which  is  a  graph  of  continuous  function  f  :  [x0,xtv]  — *  3ft,  interpolates  the 
data.  The  IFS  technique  is  described  in  detail  elsewhere  3.  Consider  the  IFS  of  the 
form  {3ft2,cjn,n  =  1, ...,  N},  where  the  maps  are  affine  transformation 


This  is  constrained  by  the  data 
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we  get 


f  xn  —  1 

V^n-1 


and  uju 


dn  —  (^n  %n  —  l)/{?'N  ^o), 

en  %  N'En  —  l  3'0'Eri)  /  {?£  N  *^o)i 

i^n  Zn—\)l  {%N  ^o)  dn{^Zjsj  ^o)/  {x N  *^o)> 

fn  ~  {xnZti-  1  “  XqZu) / (xN  -  X0)  -  dn(xNZo  -  X0Z]sf)/(x ^  -  X0) 

where  dn  is  any  real  number.  If  J2n=i  \dn |  >  1,  the  fractal  dimension  is  the  unique 
real  solution  D  of 

E n=l  KK~'  =  1. 

It  was  found  that  the  fractal  dimensions  when  calculated  for  Kandla,  Calcutta 
(Garden  Reach),  Diamond  Harbour  and  Tribeni  fall  in  the  range  of  1.2  and  1.3  as 
obtained  earlier  4.  It  can  be  said  that  the  fractal  dimensions  obtained  for  sea  level 
variation  in  the  coasts  of  India  fall  in  the  range  between  1.2  and  1.3. 

The  natural  question  now  is  whether  such  a  behaviour  is  seen  in  other  nearby 
oceans  also,  or  if  there  is  any  difference  in  the  behaviour  of  the  Ocean  which  may  be 
reflected  on  the  value  of  the  fractal  dimension.  In  the  immediate  neighbourhood, 
the  tide  gauge  data  on  Singapore  coastal  line  were  analysed.  The  three  stations,  Ju- 
rong,  Sembawang  and  Sultan  Shaol  were  considered  for  analysis  and  which  showed 
similar  range  of  fractal  dimensions.  The  coast  of  Thailand  where  the  three  stations, 
Fort  Ph.  Chimkalao,  Ko  Lak  and  Ko  Sichang  show  the  fractal  dimensions  varying 
between  1.2  and  1.3,  which  is  similar  to  the  results  obtained  earlier. 

To  check  and  compare  the  fluctuations  of  sea  level  with  respect  to  their  fractal 
dimensions,  from  one  ocean  to  other,  the  coastal  stations  of  China  representing 
Pacific  Ocean  were  considered.  The  four  stations  in  China  namely  Kanmen,  Qui- 
huan,  Xiamen  and  Zhapo  when  analysed  through  IFS  technique  give  exactly  similar 
results  as  obtained  earlier. 

The  fractal  dimensions  of  sea  level  data  for  both  Indian  and  Pacific  Oceans  are 
obtained  and  found  to  lie  between  1.2  and  1.3.  These  non- integer  values  of  the 
dimensions  suggest  small  number  of  variables  representing  the  system. 
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